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PREFACE 


he flourishing state of the psychology of 
thinking is documented by numerous books 
of readings and reports of symposia on spe- 
X Ag cial topics: concept attainment, cognitive de- 
velopment, problem solving, reasoning, creative thinking, 
judgment, and so on. But the student and the investigator 
are often handicapped by the independence of research 
developments, and an integration of these divergent trends 
is badly needed at this time. Experiments on concept 
learning and cross-sectional studies of cognitive develop- 
ment do have something in common in spite of the differ- 
ences in both terminology and procedure. Judging 
operations are not independent of reasoning operations. 
The principal aim of this book, therefore, is to pull the 
significant trends together into a coherent account of the 
psychology of thinking. 

This volume owes something to an earlier one with a 
similar aim (Johnson, 1955); some of the historical ma- 
terial in Chapter 1 and several figures are taken from 
there. But whereas the earlier volume was limited by the 
research data then available, in the present one the author 
was faced with the problem of selection. The topic of 
thinking has always held a central position in psychology 
and for this reason shares common boundaries with several 
other topics. A survey that claims to be comprehensive 
could easily stray into such neighboring fields as the psy- 
chophysiology of thought, problem solving by animals, 
and the intellectual activities of social groups. Fortunately, 
however, the decision not to trespass has been simplified 


by the good surveys of these fields already available 
elsewhere. 2 


ix 


x YS PREFACE 

Each of the topics included here could be probed more deeply, but a 
compromise between breadth and depth is always necessary. Perhaps the 
breadth of view chosen by the present account will help to loc 
areas for later, more penetrating investigations. 

In the preparation of this book I have been indebted above all to the hun- 
dreds of investigators who have struggled to gain tan 
subtle phenomena of thought, and especiall 
mission for the use of figures and tables. In 
attest to the drafting skill of Robert Almli. | 
students at Michigan State University fo 
casual conversation, that have ; 
data and terminology. Finally, 
also were very helpful. 


ate promising 


gible evidence on the 
y to those who have given per- 
addition, several of the figures 
am grateful to my colleagues and 
r discussions, often disguised as 
helped to straighten out certain confusions of 
the editorial consultants for Harper & Row 


East Lansing, Michigan D. M, J. 
April, 1971 
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CHAPTER I 


HISTORICAL 
INTRODUCTION 


Ideas in Consciousness 

Gestalt Psychology and the Organization 

of Experience 

Thought as an Instrument of Adaptation 
Psychophysics: From Sensation to Judgment 
Tests and Abilities 

Intellectual Development of the Child 
Meanings of Thought, Past and Present 
Recent Theoretical Trends and Countertrends 


„he intellectual pathways that brought the 
psychology of thinking to its present status 
SS crisscross through several centuries. The 
RLN] pioneers who took the first steps could not 
know where they were going, but with the benefit of 
hindsight we can now outline the main routes to the 
present. This will help to show where the psychology of 
thinking stands today and to explain both its achieve- 
ments and its failures.’ 


————— 


Ideas in Consciousness 


As soon as man became self-conscious, he began to 
examine his consciousness. One early result of this self- 
observation was the separation of the mind (or con- 
sciousness) from the body. The psychologies of antiquity, 
at least those of most influence on later psychologies, were 
dualistic in this sense. By the eighteenth century, many 
scholars of our civilization had become accustomed to 


1 Much of the material in the first part of this chapter is presented 
in the standard historical treatments, such as Boring (1950), ae ee 
Murphy (1949), and Peters (1953). Mandler and Mandler (1964) oa y (1959), 
and integrated some of the important historical discussions of thinking e assembled 
g- 
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analyzing their consciousness into ideas: ideas were the elements of thought 
which could be combined and rearranged to make up the complex thoughts 
of complex thinkers. Many convincing principles of the association of ideas 
were worked out by the psychological philosophers (chiefly British) of the 
eighteenth and nineteenth centuries, through the analysis of their own 
mental experiences. 


INTROSPECTION 

The British associationists were philosophers before they were psychol- 
ogists and their method was logical before it was observational, The carcful 
observation of one’s own consciousness was developed in Germany in the 
nineteenth century and nourished by Wundt and his followers. One of his 
most influential disciples, E. B. Titchener, brought the Wundtian tradition 
to America. The Wundtians aimed to make psychology—defined as an 
analysis of the contents of the mind—a precise science by refining the method 
of self-observation, or introspection. 

Classical introspection, as developed by W 
self-examination, which required knowing wl 
ignore. The elements of the mind were Sensations, images, and feelings, and 
training in introspection consisted of practice in identifying these elements. 
The method was experimental in that the 
controlled. 

Wundt did not directly concern himself 
precise introspection of the highe 


undt, was a special kind of 
hat to look for and what to 


conditions of observation were 


with thinking. He believed that 
r mental processes was impossible because 
C1) the experiment could not be repeated exactly; (2) introspection could 
not be directed to the phenomena to be observed: (3) attention could not 
be properly controlled; and CH) the conditions of introspection could not be 
systematically varied. Thought could only be studied through its products, 
such as language. 

Many psychologists did not accept these strictures. Indeed, psychologists 
have always been busy solving problems that other psychologists told them 
were impossible. So Binet, Kiilpe, Woodworth, and others went ahead, gave 
their subjects thought problems, and asked for informal, introspective reports. 
Some of the tasks used to stimulate thought were quite simple: “Give a 
word that means the opposite of compact.” Some were complex: “What is 
the more delightful, the smell of a rose or its appearance?” “Can our thought 
apprehend the nature of thought?” Binet asked his ten-year-old daughter 
just to close her eyes and think of anything. Woodworth asked his subjects: 


r AS EE E nae > 
Should a man be allowed to marry his widow’s sister?” Syllogisms were also 
used. 


THE WÜRZBURG SCHOOL 


A concerted attack 


on thinking was begun about 1900 by one of Wundt's 
students, Oswald Kü 


Ipe. Kiilpe and his followers at Würzburg asked their 


2 Titchener comm 


n a ` nmented on this in a footnote: “I may be obtuse; but I confess I can 
find in this question 


no food for thought [p. 92]” (Titchener, 1909). 


IDEAS IN CONSCIOUSNESS '* 3 
subjects to make judgments (e.g., which of two weights is heavier) and to 
report what went on in their minds while reaching the conclusions. Or, 
using the familiar association method, subjects were given a stimulus word 
with instructions to respond with another word and then to give full intro- 
spective reports. 


IMAGELESS THOUGHT 
One outcome of this work was a controversy centering on the question of 


imageless thought. This controversy—like so many others in the history of 
psychology—illustrates the critical influence of the investigator's psychological 
and philosophical background. The observers who were trained by Wundt 
and Titchener, when they examined their thoughts, “found” sensations or 
images of sensations, even though they had to look hard to find them. The 
Würzburg observers, on the other hand, and observers used by Binet in 
France and Woodworth in America, did not always find that their thoughts 
were made up of images, perhaps because they did not look hard enough. 
Instead of palpable sensations and images, vague unanalyzable experiences 
were reported, which in Würzburg were called Bewusstseinslagen (usually 
translated as “conscious attitudes”). One result of this controversy was a loss of 
faith in the introspective method. Today the controversy has lost its pungency, 
even though there was no officially declared winner. A “final comment” on 
imageless thought was delivered by Woodworth in 1938 (page 788). 


EINSTELLUNG 

Previous work on reaction time had emphasized the preparation, or mental 
set; the reaction itself was considered by several physiologists and psychol- 
ogists as a prepared reflex. Watt, an Englishman working with Külpe, 
applied this insight to the association experiment. Using systematic experi- 
mental introspection, he divided the association experiment into four periods: 
C1) The preparatory period, during which instructions were given (e.g. 
“When the stimulus word is given, name an opposite”); (2) the presentation 
of the stimulus word; (3) the search for the response word; and (4) the 
response itself. Subjects were asked to introspect on only one period at a 
time. The surprising result was that there was little to report in the third 
period: the work of thinking, as far as introspection could disclose, seemed 
to be done in the first or preparatory period. When the thinker is well pre- 
pared, the remainder of the sequence runs off automatically, with little 


conscious content. 

Thus the emphasis shifted to preparation. When a subject accepts the task 
or Aufgabe, he is prepared to respond accordingly. Acceptance of the in 
structions establishes an Einstellung, or, as Ach called it, a detenntnine 
tendency. The term mental set was often used as a syrionym until the 
adjective mental went out of fashion. k 7 


THE SUBTRACTION METHOD 


The reaction time experiment suggested a method of analyzing the mental 
On er f 26 f: ite ai 5 i 
processes that seemed at first quite straightforward. Donders in the 1860s 
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attempted to time the higher mental processes separately by measuring the 
time of a complex reaction and subtracting the simple reaction time. Others 
went even further. The time required for cognition was inferred by measur- 
ing cognitive response time, as when many stimuli are to be differentiated, 
and subtracting the simple sensory reaction time. Judgment time was inferred 
by subtracting association time from total time, which included both association 
and judgment. 

The average times were not consistent, but the chief reason for the 
abandonment of this method was Kiilpe’s argument against the elementistic 
assumption. It is true that complex reactions take more time than simple 
ones. But the change in instructions does not merely add another mental 
process; it alters the whole task. The mental performance is prepared in 
advance, and the stimulus word releases an integrated response. 


SELZ AND THE SCHEMA 


The Wiirzburgers rejected one assumption of associationism, the as- 
sumption that thoughts are always images. Selz, working with Kiilpe but 
influenced also by Kohler and the new Gestalt psychology, went further 
and rejected the associational mechanism. He presented the stimulus 
word and the Aufgabe at the same time on a card—e.g., Poem, Super- 
ordinate?—and asked his subjects for an introspective description of their 
mental activities. The results led him to stress, not the interaction of 
associations, but the integration of the Aufgabe and stimulus word into a 
Gesamtaufgabe (task as a whole). This consciousness of the total task 
guides the production of a complex of relationships, not just a class of 
words in a specified relationship to the stimulus word. 

When the task is difficult, the complex that is integrated is not complete. 
The response word is missing, but it is schematized or anticipated in the 
total Aufgabe. Productive thinking consists essentially of completing this 
incomplete schema. Woodworth (1938) asserted that the anticipatory 
schema “lays claim to a position of fundamental importance in the psychol- 


ogy of thinking [p. 798].” 


THE DECLINE OF DENKPSYCHOLOGIE 


During the first few years of this century Denkpsychologie, the psy- 
chology of thought, had some of the characteristics of an exciting new 
movement that promised to grow and spread to other countries. But this 
promise was not fulfilled. Kiilpe’s interests changed; World War I impeded 
scientific communication, and Germany’s prestige declined. Most important, 
the development of behaviorism cast su 
tions, and this sus 

Denkpsychologi 


spicion on all introspective investiga- 
picion was generalized to a taboo on thought itself. 
e was not a blind alley; at least, no one proved that it 
4 + . 
was so. But other routes to other goals appeared more attractive to active 
psychologists, Th Ë ink 

gists, e problem of understanding thinking has always seemed 


intrigui ival of i i irent 
> gning, but the revival of interest in this topic came from other directions 
and the research followed different approaches. 


GESTALT PSYCHOLOGY AND THE ORGANIZATION OF EXPERIENCE ES 


The contents of thought are often disclosed by reports of conscious 
experience, but the operations of thinking are usually missed and must be 
inferred by other methods. All the methods that are useful for the study 
of thinking today will be examined critically in the chapters ahead. i 


Gestalt Psychology 

and the Organization of Experience 

Gestalt psychology began in Germany with the study of perceptual 
Wundt and the associationists had pictured complex per- 
ceptions as built up by the synthesis of elementary sensations. Wertheimer, 
a student of Külpe, objected, saying that the perceptual pattern is primary: 
rectly, not by adding three sides and three angles. 
Perception is determined by the given sensory data and by the dynamics 
of organizing forces in the brain. These brain forces cause some patterns, 
known as good figures, to have priority over others, just as a soap bubble 
to take the shape of a sphere than a pyramid. When per- 
it is not usually by the accumulation of little changes, 
n of the perceptual field. When a new force acts on 
ange a little piece of the sphere; rather, the 


organization. 


one perceives a triangle di 


is more likely 
ception is modified, 
but by a reorganizatio 
the soap bubble, it does not ch 
whole field of forces is reorganized into a new equilibrium. 

Gestalt psychologists do not differentiate thinking from other activities; 
the laws of organization apply to all. Their early experiments dealt chiefly 
with perceptual phenomena reported by human subjects, but in 1917 
Kohler applied the Gestalt approach to the behavior of apes solving 
problems. His account of these experiments emphasized the perception 
of relationships and insight into the organization of the solution pat- 
tern rather than associative learning of new responses. 

Wertheimer in 1920 Cand again in 1945) described human thinking, 
emphasizing and recommending the reorganization of patterns rather than 
a piecemeal attack on the problem situation. Good thinking, he maintained, 
consists of seeing the structure of the problem clearly and then restructur- 
ing it as necessary. According to Gestalt psychology, those psychologists 
who treat thought as a subhead under learning, and thus try to study think- 
ing by experiments on learning mazes and memorizing words, are doomed 
to failure. Such artificial experiments eliminate the possibility of good 
thinking by depriving the subjects of the opportunity to perceive the 
significant relationships. 

This emphasis on the organization of patterns raised questions in turn 
about the structure of cognitive and social, as well as perceptual, fields. 
It was apparently Koffka who, by expanding the oar of figure and 
ground, popularized the idea of “frame of reference, which has been so 
much used in interpreting political attitudes and other judgments in a 
social context. Gottschaldt showed how patterns can be embedded in larger 
patterns, thus becoming harder to disentangle from the whole. And Lewin’s 
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topological theory of personality dealt seriously with “regions” of the ego 
and the “life space,” their differentiation and overlapping. 


Thought as an Instrument of Adaptation 


The associationists began with the contents of the mind as introspectively 
observed. Darwin published his Origin of Species in 1859; gradually, as 
the implications of evolutionary doctrine became clear, the emphasis shifted 
from the contents of the mind to the mind as an instrument of adaptation 
to the environment. The view of man as an active animal, trying to hold his 
own in the struggle for existence, put the emphasis on psychological func- 
tions or processes rather than contents. 


FUNCTIONALISM 


AND FUNCTIONS 

As early as 1890, William James complained that the introspective 
analysis of thought into elementary ideas was too static; it caught the con- 
tents but missed the processes. In an oft-quoted passage from Principles of 
Psychology, he says: “The rush of thought is so headlong that it almost 
always brings us up at the conclusion before we can arrest it... . The 
attempt at introspective analysis in these cases is in fact like seizing a 
spinning top to catch its motion, or trying to turn up the gas quickly 
enough to see how the darkness looks.” 

John Dewey, another philosopher with a psychological point of view, 
popularized the notion that man's mind and achievements, including rules 
of logic and systems of philosophy, were human solutions to intellectual 
and social problems. In an influential book called How We Think, he 
analyzed reflective thought into five stages or processes that were widely 
accepted as a functional description of a “complete act of thought.” 

Famous thinkers have often been asked to describe how they made their 
discoveries or created their masterpieces. Ifelmholtz, the great physicist 
physiologist-psyeholagist, had done this, as had Poincaré, the great mathe- 
matician. Graham Wallas, a British political scientist whose role in British 
intellectual life was similar to Dewey's in America, was interested in 
thought as a sideline. He wanted te help the average citizen control and 


Improve his thinking, so he stunmarized the contributions af the aesuela 


tionists and the Würzburg School, as well as Dewey's analysis and the 
accounts of the great thinkers. In 1926. in a book called The Art of 
Thought, he systematically described the 
as preparation, incubation, illumination, and verification. 

These descriptions of functions resulted from logical, subjective analysis. 
But experiments were soon undertaken to study and differentiate the 
higher mental Processes. Ruger, for example, in 1910 described the activities 
of his subjects solving puzzles as analysis, conscious variation, and the set- 


ting up and testing of a hypothesis. In 1927, Hollingworth separated out 
judging from Perceiving on the 


various steps in creative thinking 


one side and guessing on the other. Judging, 
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he concluded, was done with more accuracy and confidence than guessing, 


but with less than perceiving. 


CONTROVERSIES IN COMPARATIVE PSYCHOLOGY 

The controversy about evolution stimulated comparisons of the capacities 
of different species. Thus comparative psychology got its start, with some 
investigators trying to find similarities between men and the other animals, 
and others trying to find the differences. Those who claimed to observe 
higher intellectual activities in lower animals were often quite uncritical 
about it, so C. Lloyd Morgan laid down the rule that no performance should 
be attributed to a higher capacity, like reasoning, if it could be explained 
by a lower one, like experience or instinct. Morgan’s canon was supposed to 
of the law of parsimony so often invoked in the other 


be a special case 


sciences. 
Instead of relying on anecdotes and casual observation, Thorndike in 1897 


a problem box and watched them try to escape. He saw 
ideas or reasoning; his interpretation favored rather ex- 
ploratory activity and accidental success. The cats’ performance improved 
with practice because their successful activities were rewarded. This in- 
terpretation, which ca known as trial-and-error learning, was 
stated in terms of associations; but the associations yi between stimuli 
in the problem situation and overt responses, rather fen rs mer 
In England soon afterward Hobhouse, claiming pen horn i had ar 
given his cats a fair chance, ag Ce eat ee 
in a variety < hlem situations, all of which s a at Ter Sips; 
His ar to assert that the higher vertebrates sönd perceive 
vond adaptively to them. Experiments 


ects and resp sila 
d by Yerkes, Köhler, and others, stressing problem 


put some cats in 
no evidence of 


me to be 


relationships between obj 
of this kind were continue 
solvi » jinsi 

lving by insight. 

Thus the argument over 
enjoyed a lively career for h 


. -error was born. Tt has 

ins BEA oi 

alf a century, sometimes contributing moet 
' » iy general, sometimes going off into 

gg of psychology in gen her going 

ie he disputants. There is no need to 

her these interpretations 


ight versus trial-and 
taj 


and results to the progre 
special disputes of no intere 
follow: the argument per se, 


are bound to reappear in la 


DETAVIORISM. AND TES €O 


Working with animals incapable 


st except to t 
but in ene form or anot 


tel chapters: 


NSEQUENCES 

f communicatten: comparative psycholo- 
af t | 

m5 syise their ow ichinique: 

tis FN 7 ‘alogist® were forced to de vi : ABE awit techniques, 
gis : ia experimenta MOLE blet perform enlig tening a on 
and they found that they were 39 ame i 
many psychological topics py strictly objective mao’ Ipen tap ir who 

any psychologica ia i H ‘ith animals, announce e behavior- 

=e i > arking With 4 

began his psychological career working psychologists should not use any other 
ist position in 1913 by declaring that p>? 


- ental psychologists, including those not 
ax perm 2 k ` ae 
methods. For many years enn a Jemonstrated great ingenuity in invent- 
1a 


ifficult psychological problems. The 
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methods of Ebbinghaus in Germany and Pavlov in Russia, for example, had 
been quickly adopted in other countries especially the United States. Thus 
the call for an objective study of behavior received a generally favorable 
reception. 

The implications of the behaviorist approach to thought, however, did 
require some defense. When someone thinks, an outsider may not be able 
to observe any behavior at all until the results of the thought are announced. 
Watson got out of this predicament by equating thought with “inner 
speech.” Speech is behavior, to be sure, acquired as an adaptation to social 
life; but social pressure and ordinary lassitude interfere with overt speech 
under many conditions. Thus the overt speech of uninhibited children 
migrates inward, as it were, to become covert or implicit in the socialized 
adult. Association between one idea and another in consciousness became, 
to the behaviorist, association between a covert symbol for a stimulus and a 
covert symbol for a response. Behaviorists began to define thinking as 
covert behavior, or behavior involving symbols. 

To nonbelievers this seemed like obscurantism, pushing the problem 
inward beyond the possibility of empirical attack. But Watson and his 
followers believed that inner speech movements are not completely covert. 
Some efforts were made to record tongue movements during thought. 
Nothing very specific turned up, either because the recording instruments 
were not sensitive enough or because they were not attached to where the 
thought was. The musculature does show 
when the subject is thinking of somethi 

We must conclude that the 
of thought was a ne 
thought could be re 
show 


more activity in general, however, 
ng than when his mind is blank. 
influence of behaviorism on the psychology 
gative one. Watson showed how some phenomena of 
phrased to fit his system of behavior, but he did not 
how to do anything about it. His criticism of subjective methodology 
scared many psychologists away from the topic of thinking. The distinction 
between thought as conscious content and thought as a problem-solving 
function was not always grasped, so that the taboo against introspection 
spread to a taboo against the investigation of intellectual activities in general. 


THOUGHT ACQUIRES MOTIVES 


One survival of the dualism of early psychology has frequently been 
expressed, and more frequently assumed, as a contrast between opposites: 
between instinct and intellect, or between emotion and reason. Comparisons 
between species, stimulated by evolutionary doctrine, at first reinforced 
this survival of faculty psychology: man has ‘the power of reason; the lower 
animals are controlled by instinct or habit. 

j But evolutionary doctrine also saw man as endowed with the same 
instincts as the lower animals. In adapting to his environment he was 
motivated, like the other animals, toward survival of the species. Fhe unity 
of the organism was explicitly stated by William McDougall when he 


wrote that man’s power of thought was an adaptive biological function at 
the service of man’s instincts. 
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Once the artificial distinction between motivation and thought was 
overcome and the relationship between the two became apparent, it was 
necessary to work out the details of this relationship. Woodworth powered 
this movement in 1914, pointing out the connection between interest and 
ability. Freud, McDougall, and others speculated about the learning of 
new drives, or at least greatly modified secondary drives, on the basis of 
primary drives; and the modern behaviorists are attempting to show precisely 
how this learning occurs in some simple cases. The effects of various drives 
on attention, perception, imagination, thought, and judgment also are 
active topics of experimental ps chology today. 

The Würzburg school discovered the need for dynamic concepts in 
understanding thinking and tried to fill the blank with the Einstellung. 
Because its only credentials were introspective and because psychologists 
were interested in other things, this concept did not catch on. Now, years 
later, when dynamic concepts have become more fashionable and objective 


methods for their investigation have been devised, the Einstellung by one 


name or another is a favorite topic of investigation. 

The disordered mind of the madman has long been an intriguing subject 
of speculation, and many observers of the insane and the neurotic have 
attempted to contrast their irrationality with the thought of rational man. 
The gradual development of Freudian psychoanalysis in the first decades 
of this century helped to shift the emphasis from thought, in the sense of 
rationality, to motivation. 

Because of Freud’s associationist background, he used thought in its 
subjective meaning, as a conscious content. But his shrewd observations 
forced him to describe, or invent, mechanisms by which conscious thoughts 
are disguised and symptoms formed under pressure of the censor, or super- 
ego. The mechanisms are thought processes in a sense: repression, fantasy, 
symbolism, rationalization. But the emphasis in psychoanalysis has always 
been on the drives behind thought, their detours and collisions, rather than 


on the intellectual processes per se. 


THOUGHT ACQUIRES CULTURE 

When the early anthropologists of our civilization, fascinated by the 
theory of evolution, began to study less literate peoples, they interpreted 
primitive thinking as prelogical and irrational. A leader in this movement, 
E. B. Tyler, stated in 1871 that primitive man was animistic; that he looks 
at the world anthropomorphically and endows all things with a conscious- 
ness like his own. But as anthropology gradually became more critical and 
empirical, the differences between the mind of primitive man and that 
of civilized man seemed to dwindle away. The final coup, perhaps, was 
delivered by Paul Radin when he collected many wise and witty sayings 
from preliterate cultures in a book which he called Primitive Man as 
Philosopher. Anthropologists gradually came to state the differences in 
terms of cultural background, interests, and information rather than in 
terms of thought processes, though adequate comparisons between rep- 
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resentative samples of different peoples have never actually been made. 

The early social psychologists, writers like Tarde and LeBon, were 
intrigued by man’s irrationality in social situations. Illustrations of imitation 
and mob violence were collected to show that social influences make people 
suggestible and that membership in a crowd reduces the individual's intel- 
lectual powers to the lowest level. When social psychology borrowed the 
objective methods of experimental and differential psychology, demonstra- 
tions of irrationality became very popular. 

These bizarre overemphases were brought into better balance when the 
textbook writers (for example, Gardner Murphy, et al. in 1931 and 1937) 
undertook their undramatic function of integrating strange facts and diverse 
points of view. The social psychologists’ primitive glee in showing that 
people are irrational, prejudiced, superstitious, suggestible, and undemo- 
cratic was muted by more s 


ystematic attempts to discover the origins and 
operation of values, attitudes, and opinions, whether sensible or foolish. 
Later chapters will show that research on bad judgment has contributed 
in a roundabout and incomplete way to the psychology of good judgment. 

Just as important as the specific results of these developments in social 
psychology is the general effect of socializing all psychology. Emerging 
from his laboratory, the psychologist was occasionally asked by an an- 
thropological colleague how his results would differ if the experiment 
had been done in New Guinea. Careful investigators then began to con- 
sider the implications of their subjects’ 


attitudes toward any experiment or 
test, the instructions given by the 


experimenter, the laboratory atmosphere, 
and the social significance of the responses. In the past, these influences on 
thinking had been treated as nuisances, to be eliminated by careful control 
of the experimental conditions; but a nuisance taken seriously often becomes 
the starting point of more adequate theorizing. 


P sychophysics: From Sensation to Judgment 


Alongside the dramatic rise and fall of psychological schools, experi- 
mental psychology has been steadily expanding its interests and achieve- 
ments. Introspective procedures have come and gone, but psychophysics 
since 1860—when Fechner published his Elemente—has been predom- 
inantly an objective science. 

Fechner’s interest at the outset was 
mystical, He w 
tie the inner 


subjective, metaphysical, almost 
zanted to measure the intensity of sensations so that he could 
Š world of consciousness to the outer world of physical stimuli 
with the firmness of a mathematical equation. As so often happens, Fech- 
ners aims have been ignored, but his psychophysical methods are in daily 
use. Experiments were soon undertaken to determine the acuity of each 
of the senses, acuity being defined eventually as precision of judgment. 
Sensory capacities under different conditions were charted, and anatomical 
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and physiological correlates explored. Fechner’s methods for studying 
preferences for physical stimuli, for example, rectangles of various pie 
portions, were expanded into a new science of experimental aesthetics. 

When psychologists asked how these judgments were made, they nat- 
urally turned to introspection for evidence. Careful observers reported 
images and other conscious contents when comparing a sound or weight 
with a standard, but as in the Denkpsychologie experiments, the results 
were disappointing. When the observer was well prepared, the judgment 
seemed to come directly, as soon as the second stimulus was presented. 
Perhaps the most interesting results were those of Müller and his students 
in about 1900. They noted that after a series of judgments, the observer 
was not comparing the lifted weight with an image of the standard weight, 
but rather with some absolute impression of the whole series for which he 
et or eingestellt. 
psychophysical methods moved toward holding the 
judging process constant, so that the functional relations between some 
aspect of the stimulus and some perceptual consequence could be clearly 
determined. Under ideal conditions, it was assumed that the judging process 
was standardized, and therefore psychophysical data could be published as 
“From the left eye of Observer B” But in the effort to improve the psy- 
chophysical methods, a number of constant errors and other phenomena of 
judgment came to light. And when these were taken as the objects of in- 
vestigation rather than as incidental to the measurement of sensation, the 

as born. 


experimental psychology of judgment wa 
n observed when each of a series 


An important constant error was ofte 
of stimuli was compared with a standard stimulus. A midpoint or center can 
be calculated from the series of judgments, which under ideal conditions 


may be expected to coincide with the standard stimulus. It is called the 
point of subjective equality, or PSE. But it is not always exactly equal to 
the standard stimulus; the discrepancy is called a constant or systematic 
error. Hollingworth discovered (in about 1909) that when several series 
are used, the larger stimuli are underestimated and the smaller over- 
estimated. The PSEs for each series approach the center of all the stimuli 


This phenomenon, which he called the central tendency, and 
n observed in many experiments with 


had gradually become s 
Refinement of the 


judged. 
others called the series effect, has bee 


different kinds of stimuli. 
It ig not necessary to compare each stimulus with a standard. The rating 


| round the end of the last century, applied first to the 
marc aie : 

fl timuli and other complex objects of judgment. When 
r í 


in a psychophysical experiment, for instance, 
1sec bal it E ae j ; ; 
on tl Js, it was called either the method of single stimuli, 

n the loudness of sounds, „omparison, or the method of absolute 


s J » for ¢ 
Since no standard was available a nents seemed to be direct. 
Judgment, since, as noted above, te 


S i i os effect was noticed when the method of 
š ari similar “ th % he affective tone of colors or odors. 
ngle stimuli was used, 3$ 
gle stimuli was used, 


method appe: 
judgment of affective 
the rating method was U 
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Judgment of each stimulus was influenced by the preceding stimulus, in 
fact by the whole range of stimuli. In the 1920s this phenomenon was 
termed affective contrast and affective equilibrium. Then, as Gestalt psy- 
chology extended its influence, the phrase “frame of reference” was loosely 
applied to these facts when they came to light in many fields of psychology. 
In the 40s, Helson developed a quantitative formulation of frames of ref- 
erence: this assumes that the organism adapts to a certain level which is 
a function of all the stimuli and that each single stimulus is judged in 
relation to this adaptation level. Applications of adaptation-level theory in 
several areas soon followed (Helson, 1964); in fact, many of the experi- 
ments on scales of judgment surveyed in the last chapter of this book were 
undertaken in support or refutation of Helson’s concept of adaptation level 

Other quantitative models and hypotheses about judgment were bor- 
rowed from economics, personality research, social psychology, and per- 
ception and applied to more complex situations, such as decisions under risk 
and combinations of stimulus dimensions. Clinical psychologists began to 
treat diagnosis as a problem in judgment. Thus the psychology of judgment 
became more complicated, more quantitative, and more integrated with 
general psychological theory. 


Tests and Abilities 


People have always speculated about the differences between human 
beings, but differential psychology had to wait for progress in general psy- 
chology. The pioneers in this movement, Galton, Cattell, and Binet, in- 
vestigated mental processes before they constructed mental tests. Galton 
and Cattell had been working on simple processes, while Binet had been 
working on reasoning, studying the thinking of his two daughters as well 
as that of chess players and arithmetical prodigies. Hence, he characterized 
intelligence as the ability to understand directions, to maintain a mental 
set, and to correct one’s errors; and the tests he constructed along these lines 
were most useful in the school situation. 

Once the testing movement got started, after World War I, it acquired 
terminology, methodology, and, to some extent, personnel of its own. Hints 
for new tests soon followed from the successes and failures of previous 
tests, as well as from the findings of general experimental psychology. In 
recent years, the testing of intelligence and the psychology of thought have 
Progressed side by side, with only occasional interdisciplinary contacts. 

The generalizations of differential psychology have for the most part 
been of a statistical nature, based on correlations between test scores; psy- 
chological interpretations of the relationships have been secondary and 
tentative. It was noticed early that all tests that could be called intellectual 
are positively correlated. In 1904 Spearman announced his two-factor 
theory, stating that success in any task depends on g, a general ability 
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required for all tasks, and on s, an ability specific to each. Later, he devised 
methods for computing the amount of g involved in each test and the 
amount displayed by each subject. When he came to discuss the nature of 
g, he spoke of a fund of cerebral energy that is manifested in “eduction” of 
relations and of correlates. 

The two-factor theory encountered many objections. In England group 
factors, intermediate between g and s, had to be recognized. In America 
Thurstone developed a more general method that analyzed the correlations 
into a number of factors, none of which need be more general than the 
others. This method results in many factors or abilities, and a whole new 
methodology was developed for identifying abilities and purifying them 
in later tests. The enterprise has been carried forward in recent years 
by Guilford’s (1967) long-term program of test development and analysis. 
His three-dimensional model of the structure of intellect generated many 
hypothetical abilities and guided the construction of tests of these abilities. 
The kinds of intellectual operations performed are: cognition, memory, 
divergent and convergent production, and evaluation; the kinds of content 
on which the operations are performed are: figural, symbolic, semantic, 
and behavioral. And the kinds of products resulting from the operations on 
these contents are: units, classes, relations, systems, transformations, and 
implications. These abilities are all involved in thinking, in one way or 
another, but divergent-production abilities seem to make a special contribu- 
tion to creative thinking. The tests devised by Guilford and his students 
and their factor analyses of the test scores brought new excitement to the 
topic of creative thinking—long neglected by most psychologists and edu- 
cators—and initiated a new research trend. 


Intellectual Development of the Child 


The invention and refinement of intelligence tests in England, France, 
and the United States yielded useful quantitative indices of intellectual status, 
When norms became available, each child’s development could be com- 
with the development of others. But there were 


pared, in a general way, 
ho sought more than a 


always a few people, including Binet himself, w 
general quantitative index. ee 

Jean Piaget was one who rejected the quantitative measures for a more 
qualitative approach and, in the 1920s, initiated a long-term Program to 
chart the stages of the child’s progress toward adult modes of thought. Al- 
though the questions he raised and his research style were unfamiliar, he 
eventually won a wide audience in both psychological and educational 
circles. His unorthodox claims about a ae “igen: that the child 
constructs and through w o knows the world could not long be 


hich he À elih 
% ; jng integrated wit ua 

ignored and are now, in fact, being ği the more familiar 
notions. 
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Meanings of Thought, Past and Present 


In classical times, thought was a goddess who appeared in many faces. 
and today the terms thought and thinking appear in many contexts with 
several meanings which must be carefully scrutinized. The investigator 
can dream of one natural unveiled goddess, but he must work with several 
empirical manifestations. Finer distinctions can always be made, but most 
references to thought and thinking can be grouped under four headings: 
(1) Thought as a topic, (2) as mental experience, (3) as solving problems, 
and (4) as mediation. 


THOUGHT AS A TOPIC 

Thought has often been used in a very broad way to refer to intellectual 
activities, intelligent behavior, or abstract operations, just as coordinate terms 
like perception and personality are used to refer to other broad divisions 
of psychology. In this sense, thought is a nontechnical designation for the 
contents of a book or course of study; such usage does not cause much mis- 
understanding because it is not intended to be precise. The most frequent 
error consists of treating this topic of thinking as a minor topic, sub- 
sidiary to learning, perception, language, or something else. 


THOUGHT AS MENTAL EXPERIENCE 

Thinking also refers to mental experience or the contents of conscious- 
ness, as in the questions: “What are you thinking about?” “What's on 
your mind?” In this sense, a thought is the object of the verb “to think”: 
the passive form is also common: “The thought occurred to me that 
Thinking in this usage is equivalent to attending, so that the above ques 
tions could be rephrased as: “What are you attending to?” When psychology 
was defined as the study of consciousness by the British associationists and 
the Wundtians, this was the standard meaning of thought. 

Any literate person can report the ideas, images, and feclings that come 


to mind while he is smelling a rose, dreaming of past failure and future 
success, 


4 or solving a problem. When thinking has this meaning, a dis- 
tinction is made between reality thinking, which occurs during solution 
of a problem, and autistic thinking, which is regulated by personal hopes 
and fears, i 

The methods of investigating thinking in this sense are necessarily sub 
jective, and the behaviorists, who insist on objective methods, consequently 
discard this aspect of thought. Many others are suspicious of the accuracy 
of subjective methods, simply because the results obtained from different 
subjects and under different conditions are often inconsistent. But there 
need be no uncertainty about what thinking, in this meaning, refers to, as 


long as it is properly circumscribed and differentiated from the other mean 
ings. 
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THOUGHT AS SOLVING PROBLEMS 

Thought, or thinking, is also often used today to refer to activities 
directed toward solution of a problem, in contrast to routine or thoughtless 
activities. “I don’t know what to do; l'Il have to stop and think.” Many 
psychologists prefer the more awkward term problem solving, rather than 
thinking, to avoid confusion with other meanings of thinking. Synonyms 
for the verb “to think” in this sense are: to cogitate, deliberate, ponder, 
reflect, and study. In contrast to thought as mental experience, this mean- 
ing stresses adaptive activities rather than the contents of consciousness and 
does not state whether these activities are or are not open to subjective 
observation. 

Problem solving includes creative thinking, deductive reasoning, and 
judgment as special cas under some conditions concept achievement is 
a kind of problem solving also. The effort to understand a difficult 
a cryptic lecture involves thinking in the sense of problem 
difficulty lies not in the mechanical aspects of read- 
abstract concepts or linguistic structures. 


paragraph or 
solving, but here the 
ing or listening, but rather in the 
Thus the problems of problem solving range from the solution of jig- 
saw puzzles to comprehension of an argument about the meaning of free- 


dom. 


THOUGHT AS MEDIATION 

Thought is the name of a topic in the first meaning above, a mental 
experience in the second, and an activity in the next; but in another sense, 
thought refers to the mediation of activity. If one stops to think, to solve 
a problem, much will happen before the solution is executed, and these 
internal mediating events also deserve the name of thought. 

If the subjective method of observation is assumed to be adequate, these 
mediating events are open to direct observation. Thus thought in this sense 
overlaps thought as mental experience, except that the emphasis is on 
those mental experiences that occur during the solution of a problem. Wil- 
liam James, the Wiirzburg psychologists, the carly behaviorists, and others 
exposed the inadequacies of this method for the investigation of problem 
solving, but there is still some residual confusion between direct sub- 
jective observation of mediating events and theorizing about them. 

Since the solution of a problem often covers a long period when there 
is little relevant behavior to observe, mediating events seem to be theoret- 
ically necessary; but the practice today is to treat them as hypothetical con- 
structs which, like others, should be tested against observable data to con- 
firm their explanatory utility. Various modern behaviorists, in order to fit 
mediators into their general theoretical programs, have assumed that these 
constructs. have the properties of implicit stimulus-response associations, 
observing activities, covert verbal tesponses, and symbols of observable 
objects. Berlyne C 1965) has put it as follows: “Whenever a symbolic 
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response is implicit and has a preponderantly cue-producing function, we 
shall call it a thought. We shall refer to a string of such cue producing 
symbolic responses as a train of thought |p. 27].” 


Theorizing about hypothetical constructs is good clean fun, but it is not 
free-for-all speculation. The empirical validation of such constructs requires 
an explicit statement of the assumptions. Most behaviorists assume, for 
example, that mediating responses are strengthened and weakened as ob- 
servable responses are, that they are elicited similarly, and so on. The 
theoretical links are intended to tie the mediating events to observable 
antecedents and consequents, and thus to explain the overt behavior. If 
thinking means anything to a behaviorist today, this is usually what it 
means. 

Other theorists can play the same game. Hypotheses about mediating 
events also refer to cognitive structures, such as concepts, conceptual sys- 
tems, principles, and logical relations, or, in the context of information 
processing, self-instructions and programs. Several cognitive processes, as 
well as a single associative process, are also conceivable. The theoretical 
status of such cognitive mediators in problem solving is formally similar 
to that of implicit responses but there are differences that have given this 
meaning of thought a somewhat different signification. 

Concepts, for example, were discussed long before the theoretical ap- 
paratus of hypothetical constructs was invented-Education consists in large 
part of achieving concepts, because they are useful in communication as 
well as problem solving. The achievement of concepts is therefore tested 
as a measure of educational progress (without regard to their mediation in 
the solution of any particular problem) under the assumption that they 
constitute standard knowledge, which will be instrumental to the solution 
of many problems in the future. The great distinction of man compared 
to the other animals is this ability to represent things conceptually, by 
words and other symbols, and then to use these symbols in communicating 
and solving problems. ; 

Thought in the sense of conceptual or symbolic processes resembles 
thought in the sense of cue-producing responses since both have the internal 
mediating function, but concepts also function in external social com- 
munication. Whether concepts, principles, and similar cognitive achieve- 
ments should be treated under the same heading as response-produced cues 
and implicit verbal responses may be argued; but they are all mediators 
in a broad sense, and this is what many people mean by thinking, especially 
when they contrast it with immediate response to stimuli. And this must 
also be what people mean by such terms as conceptual thinking and abstract 


thought. 


AN ATTEMPT AT INTEGRATION 


Each of these manifestations of the elusive goddess has its own charm 
and has attracted, at various times, its own circle of worshippers. But the 
best scheme, at present, for embracing the complete meaning of thought is 
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a definition that pulls together the coherent aspects of all these diverse 
manifestations. In this spirit, we can define thinking as problem solving 
and including mediators and mental experience to the extent that they assist 
the explanation of problem solving. Since mediators are unobservable and 
have in the past few decades been notoriously difficult to validate, they are 
an insecure base for definition of anything. Some mediators are pertinent to 
problem solving, but not all; some mental experience is pertinent to problem 
solving, but not all. Concepts are pertinent because the attainment of a con- 
cept may be a kind of problem solving and because concepts are instrumental 
to the solution of many problems. Intellectual history—often called the history 
of thought—documents the reciprocal relationship between the struggles 
for conceptual understanding and the solution of concrete problems; so, 
over a shorter time span, does the educational history of the individual. 
If we include concepts and the solution of different kinds of problems, we 
include practically all the activities that are usually discussed under the 
topic of thinking. But retaining problem solving in the center of the 
definition emphasizes the adaptive function of thought. 

Although an integrative definition like the one just given is plausible 
and convenient for locating both the center and periphery of our topic, 
the experiments and discussions to be reviewed here have arisen in fact 
out of several different conceptions of the meaning of thinking, and clarity 
at these meanings be identified and differentiated. In 


often requires th 
animals, for example, the meaning 


questions about thought in the lower 
of thought as mental experience is not applicable; but when thought means 
problem solving, abundant evidence is available. In discussions of language 
and thought, it makes a difference whether thought means mental ex- 
perience, problem solving, or mediation. The research trend that treats 
concept achievement as problem solving overlaps only slightly with the 
trend that treats concepts as mediators in the solution of larger problems. 
is volume is the contribution of diverse experiments and 
discussions to the topic of thinking. The contribution of each cannot be 
properly evaluated unless we remember that different investigators may be- 
gin with different conceptions of thinking and that the data may be 


relevant to different aspects of the topic. 


The subject of th 


Recent Theoretical Trends and Countertrends 


Early psychological theorists of the armchair type, such as the British 


associationists, were preoccupied with the phenomena of thought. They 
speculated on what happens when someone tries to “think of” a word, how 
one idea suggests another, and why it is harder to think of some things 
than others. Then psychology became more experimental. Experimenters 
found it easier to devise good experiments for studying the behavior of the 
lower animals and the simpler activities of man than for studying man’s 
complex intellectual activities, and the theories followed the same path of 
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least resistance. Recently, thinking, by one name or another, has at 
tracted more attention from theorists. 

We must admit that, in the accuracy of their predictions, psychological 
theories are not very successful. Predictions based on current theories are 
only slightly more accurate than those based on the intuitions of an intel 
ligent layman. But psychological theories, including theories of thinking, 
generate interesting experiments and promise greater understanding in the 
future. Such theories have some value in explaining the behavior of the 
subject of an experiment; they have greater value in explaining the be- 
havior of the experimenter who designs the experiment and interprets 
the results. The three theoretical trends that are most relevant today are the 
cognitive approach, stimulus-response associationism, and the information- 
processing approach. 


THE COGNITIVE APPROACH 


Despite the older meanings, cognition is often used today in a non- 
technical sense to refer to a division of psychology, and in this sense it is 
synonymous with thought. When cognitive tests and cognitive abilities 
are contrasted with noncognitive tests and abilities, this must be the mean- 
ing intended. 


COGNITION AS KNOWLEDGE 


In the more formal meaning that one finds in the dictionaries, cognition 
refers to knowledge or the process of obt 


aining knowledge. Perception was 
formerly included under cognition in 


this sense but it is often treated 
separately today. Knowing where something is in relation to other things 
is one form of cognition; knowing what event is likely to follow another 
event is a second form. Knowledge also consists of knowing names for 
things, classes of things, and relationships between classes. This ab- 
stract knowledge is manifested largely by the use of words and other 
symbols, in accordance with the standards of the culture or of special 
branches of knowledge. New knowledge is acquired by the study of 
words as well as by the perception of subjects. Thus, cognition in the 
abstract form implies escape from the domination of immediate perception, 
and much research on standard concepts fits into this meaning. 

Knowledge and observable behavior are reciprocally related. Yet knowl- 
edge itself is not observable; it is inferred from behavior, most often verbal 
behavior, such as the use of words in accordance with cultural standards, 
answers to questions, and the manipulation of symbols to solve problems. 
Thus, although cognition is an old, established idea, methods for making 
inferences aboùt cognition will have to be examined closely in later pages- 


COGNITIVE THEORIES 

Cognitive theories take many shapes, but in general they emphasize 
the individual’s view of the world: what he knows or believes, what he 
perceives, what he expects, how he organizes his experience. The European 
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phenomenological school raised the question of how the individual ex- 
periences objects and events and how he represents them to himself. 
Gestalt psychology emphasized the organization of experience and used 


cognitive structures, such as the figure-and-ground structure, and re- 
organizations of these to interpret perception and thought. Lewin extended 
the Gestalt principles of perceptual organization to social psychology and 
invoked cognitive structures to explain interpersonal relations, attitude 
change, and motivation in social situations. 

More recent interpretations of attitude formation and change have made 
much use of quantitative relations between cognitions, as indicated by such 
terms as cognitive balance, cognitive congruity, and cognitive dissonance. 
Thus, although the cognitive message came to America with a foreign ac- 
cent, these current cognitive theories have the characteristic American love 


of numbers. 


COGNITIVE STRUCTURES 
Cognitive theories can be contrasted with stimulus-response association- 


ism in several respects (Ausubel, 1965; Hilgard, 1956; Mandler, 1962; 
Scheerer, 1954). While the associationists said that the rat in a maze learns 
a chain of responses from the starting box to the goal box, Tolman—an 
animal psychologist with a cognitive outlook—said that the rat constructs 
a cognitive map and learns where the goal box is in relation to the maze 
as a whole. The theoretical constructs preferred by cognitive theorists are 
cognitive structures assumed to represent the relevant aspects of the en- 
vironment: some of the structures that have already been investigated 
are images, classes, dimensions, systems of classes and dimensions, schemas 
that represent objects, propositions, and relations between propositions. 
According to this view, the environment is known through such cognitive 
structures, and these structures control our reactions to the environment. 
The individual does not just respond to the stimulus that he perceives; 
he constructs a pattern or representation of certain attributes of 


rather, 
and then adapts to the environment as he has constructed 


the environment 
it. Some cognitive investigations examine how an individual develops his own 


set of attributes for construing the data of his experience. And some research 
is focused, not on any particular cognition, but on certain characteristics 


of cognitive structures, such as the number of attributes an individual 


uses to describe a person. 


COGNITIVE PROCESSES 
While associationists talk about only one process, association, cognitive 


theorists speak of many processes, all of which involve organization or re- 
organization of the structure. An external structure may be represented by 
an internal structure, such as an image. A structure may be divided into 
substructures that are meaningful only as parts of the whole. Several struc- 
tures may be arranged in a hierarchy that functions as a whole. Cognitive 
structures are compared with other cognitive structures. Classes may be 
arranged into a system, and a system of classes rearranged into an alternative 
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system. A plan of action may include constituent acts which would be 
meaningless, and perhaps could not be executed, in isolation. 

Cognitive psychologists often do not explicitly identify and describe the 
cognitive structures and processes that they assume, and methods for mak 
ing inferences about these hypothetical constructs are not standardized. 
but certainly the processes differ from the cumulative learning of associa 
tions between stimuli and responses. It is the cognitive psychologists, 
especially, who object to treating thinking as a kind of learning. 


STIMULUS-RESPONSE ASSOCIATIONISM 


Modern behaviorism, except for a few followers of Tolman, depends on 
associations between stimulus and response for the explanation of behavior. 
The principles of learning, developed in simple situations, often with lower 
animals, are assumed to apply, with appropriate elaborations, to the be 
havior of sophisticated human beings in complex situations. But behaviorism 
has altered its shape considerably in order to escape the limitations of 
Watsonian theory. 

In experiments on lower animals, behavior is controlled by physical 
reinforcements—such as food, water, and electric shock—but in experiments 
on human subjects it is initiated and directed by verbal instructions and 
the reinforcement for Success is usually verbal also. Hence, associationist 
theories of human learning have attempted in one way or another to in- 
clude verbal stimuli and verbal responses, , 

Behaviorists, with a few exceptions, now admit hypothetical constructs. 
Although the early behaviorists tried to limit psychology to the study of 
observable behavior, most modern behaviorists are willing to state hypoth- 
eses about unobservable or implicit ’ 


responses that determine observable 
behavior. A hypothe 


tical construct having the properties of a verbal response 
greatly increases the theoretical potentialities of the S-R approach. A 
stimulus word may evoke a response word directly, to be sure; but it may 
also evoke an implicit verbal response, which acts as an implicit stimulus to 
evoke an observable response word, The early behaviorists treated thought 
as inner speech and hoped to get records of minimal speech responses dur- 
ing thought; modern behaviorists, on the other hand, deliberately treat 
verbal mediators as hypothetical constructs, unobservable by definition. 
Thus it appears that whereas Watson tried to explain behavior peripherally, 
in terms of muscular and glandular responses, most modern behavior 
theories explain behavior centrally, in terms of internal unobservable media- 
tors. For this reason, the term behaviorism is now misleading, but the term 
stimulus-response associationism is stil] appropriate. 

Once hypothetical mediators are incorporated in the theory, complex 
activities can be interpreted as due to chains of Ss and Rs. Theoretically, 
the solution to a problem can occur when some response, perhaps an ad 
ploratory one, is fortuitously associated with another response that is Mm 
turn associated with the goal response. 
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In addition to response chains, modern behaviorists often assume response 


hierarchies: that is, a complex problem situation elicits many implicit 
The response with the strongest association 
of less associa- 
problem. This 


responses, varying in strength, 
to the problem stimuli is elicited first, then other responses ¢ 
tive strength. If the first response is rewarded, there is no 
a problem implies that interfering responses are elicited first 
ists of the cumulative strengthening of 
others. Such a hypothetical 


definition of 
and thus that the solution process con 
the correct association and the weakening of the 

validated most convincingly in the case of 
associations can be recorded objectively and a 


hierarchy of responses can be 
verbal problems because word 
Nien associative strengths of different response words independently 

Other modifications of behaviorism have been proposed to apply to 
the important assumption in all of them being that 


complex behavior, 
able stimuli and responses are associated in accordance 


internal, unobserv: 
aws as external stimuli and responses. The pages to follow 


will document the great flexibility of modern S-R theory in meeting the 
challenges coming from other theories, especially cognitive ones. With a 
mechanisms, S-R theorists have been able, at least in theory, to 
of the complex and abstract phenomena that the cogni- 
have forced them to consider. 

asy to see why the older behaviorists preferred 
or problem-solving behavior, rather than 
they talk about thought, 


with the same | 


few simple 
account for many 


tive experimenter 

From this discussion it is e 
to talk about problem solving, 
and why modern behaviorists, when 


thought, 
usually verbal mediation. 
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sume that the solution of a problem involves a hierarchy of alternative 
cognitive structures, is this a cognitive oF an associative approach? 
Psychological theories are very general, and interpretations of the results 
of specific experiments require specific eg ae - well as minor principles 
of limited generality. Thus experimental tests © hypotheses involving cog- 
nitive mediators and S-R mediators seldom lead to results that clearly reject 
either; mathematical theorists have noted that when these alternative hypoth- 
in mathematical form, they are often indistinguishable. 
must account for complex structures of behavior 
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than the hypothetical units from which they are allegedly fabricated. low 
the structures are built up and reorganized, and whether the reorganization 
process is continuous or discontinuous, is what matters. 

The differences between S-R and cognitive views have not vanished, of 
course. There are still psychologists who dismiss any psychological explana 
tion as devoid of theor 
suggest how they are 


if it docs not identify the relevant Ss and Rs and 
ciated, and there are others (e.g., Asch, 1969) who 
assert that the associations themselves require explanation. If one is convinced 
that mediators must have the properties of responses, he will be reluctant 
to discuss abstract or “responseless” 
were reluctant to discuss “imagel 


thought, just as earlier psychologists 


thought. Cognitive psychologists are 
more willing to accept a contemplative view of thought; that some problems 
are solved by just sitting and thinking. Behaviorists insist that a response 
of some sort, somewhere, is necessary. 

Vestigial methodological differences can still be observed, if one looks 
for them. While all students of thinking seem to agree on the importance 


of mediators, some cognitive psychologists are willing to trust the phenome: 
nological method for obtaining evidence about them. Most behaviorists, how- 
ever, even those who speculate extravagantly about implicit responses and 
print many-splendored diagrams of hypothetical associations, consider the 
inference of mediating events a difficult methodological problem and hope 
ultimately to tie them to objective dependent variables. Cognitive theorists 
accept cognition on its own level; behaviorists try to reduce it to something 
else. The behaviorist approach to concepts, for example, asks how the sub- 
ject learns to make a common response to superficially dissimilar objects; the 


cognitive approach asks how the subject categorizes the world of the senses 
and brings order out of chaos. 


THE INFORMATION-PROCESSING APPROACH 


Information as a technical term came into psychology in the 1950s, bor- 
rowed from telephone engineering. A switch that may be open or closed is 
wired to a light in another room that may be on or off. The status of the 
light, which is considered the output of this little communication system» 
gives information about the status of the switch, the input. The information 
in the input may, of course, be quite large, as when there are many switches 
or when the input consists of speech patterns. Then interest turns to the 
capacity of the communication channel to transmit all the information a© 
curately, without distortion. A special index of amount of information, the 
bit, was devised to measure information in the input and output and thus 
to study the amount of information transmitted. 


MEASUREMENT OF INFORMATION 
: Psychologists found such measures useful in standardizing the information 
in the materials presented in concept experiments, in computing the informa 
tion transmitted by judgments on a rating scale, and in analyzing language 
(Miller, 1956). Many experiments gave evidence of an upper limit in the 
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human capacity for retaining and processing information when measured in 
this way. It soon became apparent, however, that the usefulness of information 
measures was restricted by the tendency of human beings to organize in- 
formation into higher units and then to process these as wholes. If we treat 
the individual as an information channel, it is what he does to the informa- 
tion between input and output that is important for the psychology of 
thinking. i 


PROCESSING OF INFORMATION 
Psychologists’ interest in information measurement declined while interest 


in information processing increased, bringing a bright new style to the 
parade of psychological theories. According to this view, the information in 
the environment is coded into categories and represented by a system of 
symbols, such as a set of words or numbers. Thus encoded, the information 
mav be stored in memory and later retrieved and decoded to fit the require- 
task at hand. But once encoded, the information can be recoded 
. The environmental stimuli are not processed, of 
course, but the symbols that represent the information in the environment. 
Information coded in a visual system may be transformed or recoded into 
m of hand signals. Information from the en- 
information retrieved from memory. If 
the information matches, a second operation follows; if it does not match, 
a different operation follows, or the same operation is repeated; some of the 
processes of information processing are: encoding, decoding, recoding, trans- 


ments of the 
and processed in many ways 


an auditory system or a syste 
vironment may be matched with 


forming, matching, scanning, storing, and retrieving. The more complex 
processes can be understood as strategies for getting around the limitations on 
the amount of information an individual can attend to at any one moment. 


COMPUTER SIMULATION 
Information processes were suggested by research on information measure- 


analysis of intellectual operations, and by the poten- 
tialities of electronic circuits, which can recode, store, retrieve, transform, 
and perform many other operations that go by the name of artificial intel- 
ligence. Some information-processing theories have been deliberately written 
in the form of a program for an electronic computer in order to see just 
how far the computer can go in simulating human thinking. Such programs 
are as interesting theoretically as other theories even though they may not 
fit the requirements of any particular computer. It is the black magic of 
computer simulation, however, that has made possible an exciting and easily 
misunderstood approach to the understanding of thinking. 

Computer-based theories, or models, merit description here because they 
differ in intent, power, and certainly in style from the more familiar theories 


that have so long held the center of the psychological forum. Like mathemat- 
stated than verbal theories, 


ment, by the subjective 


ical theories, computer theories are more precisely 
Just as one must make precise statements to permit mathematical calculations, 


one must make precise statements to program a computer. Once the theore- 
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tical details are worked out and programmed for the computer, the program 
constitutes the model, and the computer can be dispensed with—in theory. 
Usually it is necessary, however, to perform the many involved numerical 
operations and realize the model. Furthermore, a computer model has so many 
interacting component operations that after the start button is first pushed, 
the system's behavior, like that of any other automaton, may not be exact- 
ly what its builder expected. 

The great power of electronic computers could, of course, be harnessed 
to any theory that can be stated explicitly. But, as it has happened, most 
computer programs for the simulation of thinking have been based on an 
information-processing approach rather than, for example, an association 
approach, and have aimed at a description of processes rather than the predic- 
tion of results of experimental manipulations. These characteristics may not 
be permanent; Berlyne's (1965) speculations point the way to computer 
models for a modified S-R associationism. 

The most prominent feature of the present computer models for thought 
is their attempt to simulate in detail the output of a human problem solver. 
Other theories attempt to predict group means or proportions of successes 
under different experimental conditions or perhaps the shape of a curve. 
but computer simulation aims to duplicate the problem-solving efforts. of 
an active subject as recorded in a protocol covering many pages. The sub- 
ject’s final solution, or failure, is only a small part of the record. Thus a 
program for simulating problem solving is quite extensive, and it is particu- 
larly difficult for computer model builders to communicate to others what the 
theory says. The model is not neatly wrapped up in a few verbal statements 
or mathematical equations. It consists of many pages of specific details writ- 
ten in a specific language—most often a language claborated to fit informa- 
tion-processing concepts to the requirements and capabilities of the digital 
computer. Thus this is no easy way to instant development of a new theory. 
The writing and checking of a simulation program takes a tremendous in- 
vestment of time and skill. 

Such theories, if they are theories, differ from other theories, and the 
procedures for verification differ correspondingly. The artificial intelli- 
gence of the computer, its capacity for complicated numerical operations— 
after being programmed by a sophisticated human being—does not demon- 
strate that its behavior simulates human behavior. A direct comparison 1$ 
necessary, The observable behavior of the human subject is, of course, his 
oral or written output, while the computer prints its output in the informa- 
tion-processing language; but the two outputs can often be translated into 
the same symbol system so that agreements and disagreements can be counted. 
Furthermore, many attempts at simulation select problems which require 
he human subject to make responses, such as Green square, Turn both 
switches wp, and LLRL, that are easy to compare with the computer printout. 
` Computer simulation experiments generally use only a few subjects and ad- 
just the program to each; the agreement of the computer output and the 
human output is checked for pach subject separately. In another procedure, 
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the two outputs are compared during the first half of a problem-solving en- 
terprise, the necessary adjustments in the program are made to take care of 
the discrepancies, and then the modified program is checked against the 
last part of the human output. One small indication of the success of 
simulation is that, when different problems are solved, the order of difficulty 
for the computer is about the same as for the human thinker (Hunt, Marin, 
and Stone, 1965; Simon and Kotovsky, 1963). 

Certainly more goes on than is written down by the subject. There is no 
place as yet in computer programs for casual comments of the subjects, such 
as “I think you are trying to confuse me,” that may or may not indicate 
something important about the direction of his thinking. But if the elements 
and the moves of the problem can be symbolized without too much difficulty, 
the subject’s output may represent his moves, and perhaps his hypotheses 
and strategies for testing them, fairly well. The hypothetical, covert activity 
as for all others. Any observable output 


is a question for this type of model 
can be generated by more than one set of hypothe 
The hope of the computer experts is that, if the program simulates the 
overt aspects of complex problem-solving behavior in detail, then the informa- 
tion-processing operations that produce the computer's output will be similar 
to the human subject's internal operations that produce his. And these 


al internal operations. 


operations can be examined by examining the program. 

Like other types of theories, computer models lead to new experiments. If 
the program solves a problem by a certain operation, a new type of ex- 
periment may be undertaken to sce if flesh-and-blood people perform such 
an operation. It is not necessary to assume that neurons work the way tran- 
sistors work. Designing an experiment for human subjects suggested by the 
performance of an electronic wizard is no different in principle from design- 
ing a experiment suggested by animal research, or a logical proposition, or 
a theory of social organization. Here, as elsewhere, progress comes from 
the twor ; interaction between theory and experiment. 

The reactions to computer models of thought go both ways, like reactions 
to psychoanalysis, factor analysis, and other new ideas. While some people 
have hailed the dawn of a new era and rushed to adopt computer terminology 
as the mark of sophistication, others have viewed the computer as a dehuman- 
izing machine and, more pertinent here, certain psychologists have perceived 
it as a threat to their theoretical establishment. It is safe to predict, however, 
that computer theories will not displace other theories any more than 
clerical applications of the computer have displaced clerks. Computer models 
do not show that S-R theories, Gestalt theories, or any others are wrong. 
Instead of attacking, computer simulation is outflanking entrenched theories 
and opening up attractive new fields for electronic exploitation. Theorizing is 
becoming more differentiated. 

Two consequences of this innovation take the form of the lifting of old 
taboos. The prohibition of process-tracing experiments, maintained for many 
years by the S-R establishment, cannot stand against the accomplishments, 
limited though they are at present, of simulation models of problem solving, 


26 ¥! HISTORICAL INTRODUCTION 


Secondly, words like thinking and thought, outlawed by some psychologists 
and treated with suspicion by many, are now being rehabilitated. Since the 
pioneer Turing raised the catchy question “Can a machine think?” in his 
celebrated 1950 paper in Mind, the computer fraternity has used these 
terms freely, and psychologists’ language habits cannot resist the prestige as- 
sociated with a machine that makes $400 an hour. Theoretical activity seems 
to be immune to accusations of teleology and anthropomorphism when so 
much of the activity is carried on by an electronic computer. 


ECLECTICISM AND INTEGRATION 

The theoretical approaches mentioned above will be applied to specific 
topics in the pages to follow, but a large share of the research on thinking 
proceeds from an eclectic position that can be characterized in negative terms 
as nondoctrinaire and is not far from functionalism. Many psychologists in 
planning research and discussing their results slip easily from associationist 
language into cognitive language and back again, adding a phrase about 
transformations, with a semantic flexibility that seems irresponsible, or at 
best superficial, to true believers. The eclectic is likely to be more receptive 
to new ideas and new language, even to those that do not fit well with the 
old, than the doctrinaire. The eclectic strategy is organized, not by a grand 
inclusive theory, but around effects or principles of medium size, such as 
functional fixedness, and around the diverse influences of important variables, 
such as word frequency, with the expectation, not usually explicit, of a 
grand synthesis later. 

If we assume that an individual trying to solve a problem will do whatever 
he can under the circumstances, it is easy to say, with eclectic informality, 
that he uses facts, tools, concepts, knowledge of the situation, knowledge 
of common errors, strategies of recall, strategies for processing information, 
and any other tricks he may possess. The controlling influences of attention 
and set were studied from a nondoctrinaire position before the behaviorists and 
the cognitive psychologists broadened their systems to incorporate such 
controls. We could say instead that the situation elicits various response 
patterns, or that the field forces compel cognitive reorganization, or that 
the input information is recoded. The functional manner of speaking, which 
appears to make the individual rather than the situation the master of his 
behavior, is particularly convenient for discussing the polymorphous activities 
to be covered in this book. But it is just a manner of speaking. It does not 
bias the interpretation of the data, and other language could serve as well 
—though awkward circumlocutions would often be required. 

The integration of findings from different approaches is facilitated by 
translation from one theoretical language into another, but the grand synthesis 
is advanced more fundamentally by the identification of common principles. 
To this end, Campbell (1960) has drawn attention, at a very abstract plane 
of analysis, to the wide variations in behavior and the selective retention 
of the more adaptative variations at all phyletic levels and all levels of human 
activity from reflex to creative thought. “A pblind-variation-and-selective- 
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retention process is fundamental to all inductive achievements, to all genuine 
increases in knowledge, to all increases in fit of system to environment [p. 
380].” These variations may be genetic variations retained by natural selec- 
tion; they may be variations in behayíor retained by selective feedback. Large 
cognitive units of knowledge may also be produced in variety, and the 
selection may be cognitive, Based on knowledge gained from previous blind 
variation and selective retention. 

With similar intent at integration but within a more limited range, Mandler 
(1962) has shown how some cognitive structures can be built out of as- 
sociations and Berlyne (1965) has used his neo-associationism to explain, in 
abstract terms, the “structure and direction of thinking.” The integration of 
associationist principles for simple learning with cognitive principles for 
abstract operations offers a possibility of sy S hat is being explored by 


many contemporary students of thought. 
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s happens so often in the development of 
a science, the two terms cognition and con- 
cepts refer to overlapping sets of facts, yet 
È SAS they have not often appeared in the same 
context. Both have had long and honorable carcers in the 
history of psychology, but they have functioned in differ- 
ent traditions and their paths have seldom crossed. Both 
will be more clearly understood when they 
together. This chapter discusses the f 
summarizes the evidence on con 
achievements, mostly from laborate 
will discuss individual 


are brought 
nature of concepts and 
ceptual and cognitive 
y research. The next 
differences and development. 


The Nature of Concepts 


As a simple illustration of the nature of concepts, let 
us consider the case of the child who has divided up his 
environment into things that are animal, vegetable, or 
mineral. We could say that this is a three- 
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part cognitive 
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SR or that he has learned three concepts. Perhaps he has the animals 
urther divided into large and small, the vegetables into edible and inedible, 
and the minerals into soft and hard. This organization, represented below, 


illustrates ; sels AA A a SENG 5 
trates a more elaborate cognitive structure or conceptual system: 


ANIMAL: large, small 
VEGETABLE: edible, inedible 
MINERAL: soft, hard 


llis knowledge of the objects of his world has been organized and, within the 
limits of his ability, he can classify objects according to this system. Further- 
more, he knows the words for each class so that he can communicate with 
other people about them. And he can use this cognitive structure to assimilate 
new knowledge and to solve problems. For example, if he sees a picture of an 
unfamiliar object and is told that it is an animal, he will expect it to act as 
other animals act. Once the child has achieved such a cognitive structure, he 
can make some sense out of what would otherwise be a meaningless world. 
logists have struggled with are: How do 


Some of the questions that psycho 
learn classes? How do they acquire more 


children, and adults for that matter, 
stems? How do they attach words to classes? How 
ss of words? 
pts first because they are the basic elements 
results of concept research will be helpful 


Fie a eal e cognitive $ 
de SNe 4 r 
do they learn the meanir 
l ie a m . . a 
tis convenient to consider conce 


In a systematic treatment of thinking, 


of thought and because some of the 
in later chapters. 

THE SPECIAL INTERESTS OF PSYCHOLOGICAL SPECIALTIES 
first to study 
notion. On the contrary, teachers, philosophers, 


have discussed the concept in general and 
for a long time, and it is only recently 
for empirical investigation of the 


Psychologists were not the the concept; no science or art has 


clusive rights to this elusive 
mathematicians, and other scholars 
the Special concepts of their disciplines 
that sychologists have devised methods [ } 
concept and have tried to fnd a place for it in their theories. i 
Experimental psychologists have been concerned with the learning of con- 
cepts under controlled condit laboratory. They tend to use artificial 
conce to the method, and the experiment ends when the 
ached. These: concepts are therefore defined to fit 


ions in the 
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criterion of learning is re 

this situation. 2 i 

Developmental psychologists have tried to trace the AEE REA e 

the child, charting his conceptual abilities, limitations, and characteristic 
g his To give one example, a prominent trend 


tl ; 

Ought patterns at different ages. d he abstract verbal thi 
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Consists of learning the standard co 
ih course the most difficult. Teacher 
With a single concept, but also with a system 


agt understand and promote learning in 
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-s and textbooks are concerned, not only 
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of a topic of convenient size, such as the nitrogen cycle. Here the definitions, 
methods, and criteria of achievement have all been devised to fit the educa- 
tional situation. 

Thus the psychological specialties have looked at concepts from different 
points of view and have used their own materials and methods of investigation. 
The concepts studied by educational psychologists and tested by educational 
achievement tests, for example, have been more realistic and more abstract 
than those of experimental psychologists. E 
contribute, however, and we shall try 
a general understanding of concepts. 


ach specialty has something to 
to integrate their contributions to attain 


THE SEMANTICS OF CONCEPTS 


Another specialty that contributes to this understanding is semantics, which 
studies the relation between words and what they refer to. In the example 
above, when the child has mastered the concept of vegetable, he uses the word 
vegetable, or some other symbol, to refer to this class of objects. The symbols 
most widely used in intellectual operations are words, of course, and whatever 
a word refers to is its meaning. The function words, such as to, of, and or, 
do not have meaning in this sense; they are used rather as syntactical devices 
to relate different parts of a communication. It is the object words, mostly 
nouns, pronouns, verbs, and adjectives, that have meaning in that they refer 
to other things, such as objects, events, Persons, actions, qualities, and classes. 
Thinking consists in large part of manipulating meaning by manipulating 
words, and the study of thinking requires a minimal an 

In its most general sense, meaning is know 
tions between things. When we inquire 
event, we are inquiring about how it is 
into larger historical trends. When we 
attempt, we are asking how it fits in 
when we ask the mean 


objects and to other words. The meaning of a word is a special case in that 
the word-referent relation is quite arbitrary, but it is culturally standardized 
now what the word refers to. Communication is 
pacities are extended because standardized sym- 
n sentences and formulas, while the things they 
too numerous, or too far away to be easily summoned. 
be abstract qualities, like truth, beauty, and goodness, 
to or picked up but can be designated by words. 

to use words precisely, speak often of terms rather 
combination of several words may have one referent. 
at designates a unitary referent is called a term. These 


alysis of meaning. 
ledge of structure, of the rela- 
about the meaning of a historical 
related to other events, how it fits 


than words, because a 
Any word or phrase th 
may be singular terms, 


of Liberty, 
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different associations; hence it is convenient to speak of an identity class 
to which the singular term refers. 

When a general term is used to refer to a class, the extensional significance 
or meaning of the term is the set of objects denoted by that term. The ex- 
tension, or denotation, of the term dog includes all members of the class, those 
actually living and those yet to be born. The intensional significance consists 
of the properties that define the class and characterize the members of it. Dogs, 
for example, are carnivorous, they suckle their young, they are domesticated, 
they have four legs, etc. In different contexts, these properties are called 
attributes, qualities, and dimensions. 

The use of a term to refer to an object or a class of objects is not true 
or false in any absolute sense. The meanings of terms are a matter of 
definition or agreement between their users, subject to change over time 
and place. Nor is the word-referent relation unique, restricted to one word 
and one referent. All adults use different terms, or synonyms, to refer to a 
single object, a single term to refer to different objects, pronouns in place of 
5 alternative constructions as the situation re- 


nouns 
ter) and the comprehension 


paraphrases, and other 
quires, The usage of words by a speaker Cor wri 
of them by a listener Cor reader), as Olson (1970) has pointed out, depends 
on the knowledge each has of the alternatives in the context. In domestic 
situations, cat means an animal, or class of animals; on a construction job, cat 
means a caterpillar tractor and if there are no alternatives in view, there is no 
ambiguity. But when there are alternatives in the listener's context, two 
cats, for instance, the speaker chooses his words so as to eliminate alternatives: 
“the black cat.” Or if cats have been under discussion, he may refer to the 
same object as “the black one.” Thus the term cat refers to different objects, 


and the object cat is referred to by different terms. This use of alternative 


constructions by the speaker to communicate with the listener is not a special 
P of meaning, and of thought. 


case but a fundamental characteristic of language, i 
Symbols are not entirely arbitrary. Representational signs and symbols 
erties in common with their referents. When we 
Speak of a bumble-bee buzzing around the clover, the word buzzing sounds 
like more than an arbitrary symbol, presumably because the sound of the 
a CO ith the sound of the bee. When the 


word has some attributes in common WI h h 
or R > 
weather man indicates the location of clouds on the weather map, he may 


draw curvy lines that have some visual resemblance $ clouds in the sky. 
The map also has a basic spatial TT e area pe aces 
For this reason, some international road wan aNd} te ; = a y: 
; ; : turns, bumps, and railway 
using pictographs to signal the approach of sharp ” PS; y 
crossings. l h 
5 3 " x explicit characteristi 
The meaning of a word includes, not only the exp ics of 


a class or dimension, but in many cases a suggestive es ne or ‘ast 
i y yar. ue refers to a color 

tion. Dove suggests peace as hawk suggests A jodie sata ut 
also symbolizes a mood. Scientific, legal, an ant ee 
better when the linkages between terms and referents are unique and ex- 


have some perceptual prope 
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plicit, while the purposes of artistic communication are often better served 
by the evocation of suggestive verbal effects. 

“The understanding and use of conventional symbols, such as words, to 
be discussed in this chapter, is the end of a complex interaction between 
maturation and learning. The cognitive development of the child, as he gives 
up his childish ways of representing the objects of his environment and moves 
into the abstract symbolic world of the adult culture, will be discussed sys- 
tematically in the next chapter. 


AN ATTEMPT AT A COMPREHENSIVE DEFINITION 

In past centuries when the concept was considered a generalized mental 
image, there was no problem of definition. But the overthrow of the image 
was followed by definitional anarchy. The notion of an internal event of 
some kind persisted, and various internal stimuli and responses were hypo- 
thesized to replace the image, but none was accepted as definitive. Many 
educated writers use the term concept, without explicit definition, 
to any object of knowledge. Many dictionaries define 
idea, but this does not please psychologists, w 
some good research can be done without 
research from various approaches and any 
take the problem of definition seriously, 
Is it a mental experience, a response s 
a culture trait? And the problem 
definition. How do we know 

Certainly, a concept is not 


to refer 
a concept as a general 
ho cannot define an idea. While 
a good definition, any integration of 
effort at deeper understanding must 
What kind of thing is a concept? 
, a stimulus, a set of objects, a word, or 
of measurement follows the problem of 
that someone has achieved a concept? 


observable; it cannot be an observable stimulus, 


or a set of objects, or an observable response. A concept is an abstraction from 
the world of the senses, and it is this s 


great utility in thought while 
But psychology has to deal w 
The best procedure today 
specifically an ability 
others cannot do. The 
Properties; he can use 
and to solve abstract 


ame abstractness that gives a concept its 
making definition and measurement so difficult. 

ith abstractions. 

is to treat the concept as a hypothetical construct, 

construct. One who “has” a concept can do things that 
literate adult can classify objects on the basis of abstract 
words to communicate with others about abstractions 


mbol is not the concept. If someone asks, “What 
» €veryone understands that the question is not 
he abstraction to which it refers. Each individual 
mself but, if he communicates with and learns 
itive structures through which he views the world 
ts of the culture. 

of a concept as a cognitive structure which each 
à If in his continuing effort to impose meaning on 
the chaotic world of his se = 


enses. It is also reasonable to think of a concept as a 
Piece of the culture. handed down, like the rest of the culture, from one 


= . : 
An to the next. But in most cases, the end results are the same. 
ach individual is confronted with multitudinous experiences that become 
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meaningful during the socialization process as he organizes his cognitive 
structures to correspond with standard usage. It is true that many individuals 
make occasional use of idiosyncratic cognitive structures that they do not put 
into words—the idiosyncratic thoughts of schizophrenics are especially in- 
teresting to clinical psychologists—but these are exceptional rather than 
typical. 

A concept, then, is an abstraction, and achievement of the concept is 
demonstrated by use of the abstraction for classification, communication, and 
problem solving, according to the standards of the culture. One who has 
thoroughly mastered the concept of dog, for example, has the ability to dis- 
criminate ‘dogs from other objects, to describe their characteristics, to under- 
stand and use the word dog in the conventional way, and to apply this 
knowledge when planting flowers or approaching a farmhouse. It is not 
unreasonable to treat concepts as the basic clements of thought when this 


functional versatility of the full-fledged concepts mastered by a normal literate 


adult is emphasized. 


The Structure of Concepts 
A short tour through the arts and sciences will disclose many varieties of 


concepts, but there are four that must be distinguished here: class concepts, 


dimensional concepts, explanatory concepts or principles, and singular con- 


cepts, 


CLASS CONCEPTS 

Knowledge is organized: to know the horizon is to know what is above it 
and what below. A knowledge of the concept of trees includes knowledge of 
they are part of the superordinate class 
sses such as oaks and willows. 
synonyms; we could equally well 
‘re discrete structures that differ 


their place in a cognitive structure: 
of plants, and they have subordinate cla 

In this context, class and seg are 
speak of the categorical concept. Classes are 
qualitatively one Fron another: ae triangles, dogs, picnics, prime mee 
angels, limericks. Unidimensional classes are characterized by one attr ee 
feature: red objects, things that break, sharp things. Most classes sate 
dimensional, characterized by two or more attributes: black dogs, mae : 
angels, sharp red objects that break. As noted above, classes He be a 
extensionally, by identifying the instances included in them, and intens y, 


heory, often 
by describing the attributes of the class. In the language of set theory, 


i seyi oncept can be 
used in computer simulation of concept achievement, a class co p 


defined to designate “either a set of elements satisfying some criterion at oi 

the criterion itself [p. 98]” (Reitman, 1965). The set of clements gives the 

extensional definition and the criterion the intensional definition. 
The important properties of classes can be illustrated by Fig. 2.1, which 


shows a set of cards that were constructed for co A 
attributes, three values of each: the shape of the figures (squares, circles, 


ncept research, with four 


34 = COGNITION AND CONCEPTS 


1.|| jp a zi 0O CI 13. a e || 


M4) cpap gP 


15. ab 


‘I000] -lo o a[O O 


FIGURE 2.1. Cards for research on concept attainment. A 
larger number used by Bruner, Goodnow, and Austin (1956) 


selection of 18 from a 


crosses), the number of figures Cone, 


two, three), the texture of the figures 
Copen, cross-hatched, solid), and the 


number of borders (one, two, three). 

CONJUNCTIVE CLASSES 

In order to be classified 
criteria: he must be above 
class of licensed drivers is 
of two attributes, In Fig. 2 
a conjunctive class com 


as a licensed driver, a person must meet two 
a certain age and pass a certain test. Thus the 
a conjunctive class, made up of the conjunction 
-1, Numbers 3, 6, 8, 11, and 17 are instances of 
posed of squares with double borders. In this case, 
We say that shape and borders are the relevant dimensions; variations on the 
other two dimensions are irrelevant to this concept. Numbers 2, 12, 13, and 
17 are instances of another conjunctive class. What are the relevant attributes? 

DISJUNCTIVE CLASSES 

Less familiar is the dis; 
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Numbers 1, 4, 5, 8, 10, 12, 15, 16, and 18. Another disjunctive class includes 
Numbers 4, 6, 7, 10, 11, 18, and no others. What are the relevant features? 
The reader who works these little problems will agree that disjunctive 


classes are more difficult to grasp than conjunctive ones. 


RELATIONAL CLASSES 

A different type of class is characterized by relations between features 
rather than the presence of features. An example would be married couples in 
which the husband is taller than the wife. Another example, containing the 
same features, would be married couples in which the wife is taller than the 
appear to be abstract but the relation may be one 


E 
The class of isoceles triangles includes triangles that 


husband. Such clas 


that is easily noticed. 
vary considerably in size and shape, but the equality of two sides is readily 
perceived. On the other hand, married couples belonging to the same political 
party would be hard to distinguish from married couples belonging to differ- 
ent parties. In the cards of Fig. 2.1, Numbers 1, 5, 8, 11, 12, 13, and 17 are 
instances of a relational class in which the number of borders is the same 
as the number of figures. Another relational class includes Numbers 3, 4, 18, 
and no others. What are the relevant attributes and what is the relation 
between them? Relational classes, as well as disjunctive classes, are usually 


more difficult than conjunctive classes. 


DIMENSIONAL CONCEPTS 

Cognition of the world consists not F } = ‘eal 
areways : . : nensions. Dimensional con- 
categories but also of locating them on abstract dimensi - c 
common with class concepts, but dimensions are continu- 
Dimensions are continuous properties, qualities, 


or attributes, abstracted and generalized from specific objects and events; y 
wn é 2 i sneral terms. ir usefulness is evident 
are associated, like classes, with general Lers The S Jien di 
when objects are compared in respect to their dimensions and when dimen- 
sional terms, such as helpful, very bad, and bright red, are applied to objects 
and events. Children use categorical te big and me sei one 
at an early age; later, they use dimensional terms like great ee ae 
aly BBEs Hate, WY, pare i ion. use ol 
and tiny, that suggest knowledge of a continuous dimension The use o 
V SS eight for the continuous dimension, is 


only of assigning objects to abstract 


cepts have much in 
ous while classes are discrete. 


rms like 


general terms, such as length and u 
a later achievement 7 
ore i o i ] qualities 
Some of these dimensional concepts are based on salient perceptual q 

d are therefore relatively easy 


such as size, color, loudness, and fragrance, an n 
» color, loud ve acceleration are harder 


P ies like 
to learn. Those based on more abstract properat ag i age pa i i 
to learn, Many of the technical terms listed in the glo: psy Sy 


i . bee ei 
textbooks refer to abstract dimensions: ae Se 
nes PA 5 " E ional classes I ove are 
rigidity, social distance. The multidimens! 


: p i inuous attributes. 
constructed from continuous as well as discontin 


EXPLANATORY CONCEPTS 


In some contexts the word concep 
ciple, such as the frustration-aggress! 


„t means an explanatory concept or prin- 
on concept. A principle is an abstraction 
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of a higher order of complexity than a concept, since it states 
tween concepts—either dimensional or class concepts. The 
equals the angle of reflection; cities grow near harbors. A principle takes 
the form of a proposition and thus, unlike a concept, may be true or false. 

Principles are more important than concepts in the long run. Whereas 


concepts help us to refer to events, principles help us to understand and pre- 
dict events. Much of the research on the le 


a relation be- 
angle of incidence 


arning of concepts applies also 
to principles, but it is convenient, because of the nature of the research 
trends and the problems of measurement, to p. 
and give them special treatment | 
and again under reasoning. 


ass over principles at this point 
ater under the heading of problem solving 


SINGULAR CONCEPTS 


A singular term refers to a single object or event, and in some contexts the 
word concept, or conception, is used for cognition of the object or event 
designated by a singular term. It is standard usage to speak of one's concept 
of the moon, or D Day, or the Taj Mahal. The referent of a singular term 
may be a cluster of memories, perceptions, affects, and 
direct experience and on communication fro: 
moon, for example, may be acquired by 
in different phases, under different 
also on pictures in books, lectures 
exploration. But all these attributes 
association with a singular term. 

Singular concepts have 
but, as one illustration, Sal 
the conceptualization of a 
angles, 


associations, based on 
m others. One's concept of the 
direct perception at different times, 
atmospheric conditions. It may be based 
on astronomy, and news reports of lunar 
are organized and held together by their 


not been studied as thoroughly as class concepts, 
tz and Sigel (1967) were 


able to plot age trends in 
single 


person from photographs taken at different 


Procedures for Studying Concept Achievement 
Psychological specialties with 
different procedures for studying 
of objects and ask someone to cla 


to label them with a word that r 
word and ask someone to 


different interests in concepts have devised 
them. It is possible to begin with a variety 
ssify them, to state how to classify them, or 
efers to the class. Or one can begin with a 
apply it to objects, to give its meaning, or to carry 


g 
out other verbal operations according to standard usage. Only the more com- 
mon procedures will be briefly described here as a guide to understanding 
the results of such studies, 


Classes are frequently manufactured for concept experiments from familiar 
attributes (such as size, sh 
periments of this kind ther 
they do include leaming w 
how these attributes are co; 


p: 
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PROCEDURES FOR STUDY 


METHOD OF PRESENTATION 

Tests of knowledge of concepts present the materials once only, while 
experiments on the learning of concepts present the materials repeatedly. One 
method of studying concept learning, the reception method, presents cards, like 
those in Figure 2.1, to a subject one at a time. His task is to classify them by 
stating that cach is a positive or a negative instance of the class the experimenter 
has in mind. The experimenter usually tells the subject after each response 
whether he is right or wrong. Another method, the selection method, lays all 
the cards on the table simultaneously. The experimenter identifies one card 
as a positive instance, then the subject is permitted to select other cards and 
ask the experimenter if each is a positive or a negative instance. By either 
method, the experiment continues until the subject has reached a specified 
criterion of learning. 

The subject of an experiment by the reception method is in a position 
rather like that of a student in class listening to the lecturer present the 
information. The selection method is analogous to the situation of a student 
who has the same information available but studies it in his own way. As 
second is usually more efficient, other things being equal 
1956; Reed, 1946), but occasionally it works 
the selection method makes greater demands 


one might expect, the 
(Bruner, Goodnow, and Austin, 

the other way, apparently because 
on the subject (Murray and Gregg, 1969). 


TESTS OF CONCEPT ACHIEVEMENT 
omeone has acquired a concept? There are many 
precisely because concepts have so many func- 
tions in intellectual life. The usefulness of each depends on ee of the 
concept and of the function to be studied. The scientific yield fronr an a 
of course, when concept achievement is measured y 
able, and in fact multivariate designs are be- 


How do we know that s 
tests of concept achievement 


periment is greater, 
more than one dependent vari 
Coming more common. 


f making a common response to 
rficially dissimilar, and a different 
pulate that the objects be dis- 
ut the identity response. For 
ed according to the shape of 
heir relevant attribute, shape, 
attributes. 


CLASSIFICATION 
The classification opera aie 
objects that belong in a class but are § i sti 
; : +. necessary 
Tesponse to other objects- It is necessa Fortes 
WA y n a) 
Similar in some attributes in order to classifi 
k are 
example, when the cards of Fig. es ilar in t 
: ee ila 
the figures, the positive instances are Sup jeant 
pu or, irre ; 
but dissimilar in respect to the other, vimportant. Experimental subjects can 
The nz f th ssponse is unin à dtl K 
ature of the respc Jass of objects and that on the 
ae s ith one class dae 
associate the key on the right with ly well use the terms dax and not 
left with another class; or they can equal k ot in making the response but 
or A and B. The conceptual achie 
in making it to certain objects rathe 
Conventional concept learning exp 


tion consists © 


ments require attempts at classification 
ert 
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from the beginning, confirm correct responses, and then plot progress as an 
increase in the number of correct responses. Hence the response made asa 
test of achievement is the same as the response made during practice. There 
are exceptions to this custom, however, Some experiments (these will be de- 
scribed) have used the classification test following practice by other operations. 
Many concept learning experiments present the same materials for classifica- 
tion repeatedly until all, or nearly all, are classified correctly. In such cases, 
it is customary to add a generalization or transfer test with new materials in 
order to eliminate the possibility of rote learning of specific instances. 
Classification is a test of extensional meaning since instances that belong 
in the class are identified separately. A test of intensional meaning requires 
a description of the class, by naming the relevant attributes and by the rule 
of combination. These two aspects of intensional meaning are not the same. 
It frequently happens, for example, that a subject learns the relevant attributes, 
such as shape and number, correctly, but learns the wrong rule for combining 
them; he acquires a conjunctive concept whe 


n a disjunctive concept is correct. 
The classification operation is often part 


of a concept learning experiment, 
but it is used also as an educational achievement test: 


example, specimens of tissue are presented and stude 
class to which each belongs. 

Hundreds of experiments on concept learning hav 
the classification operation, so many in fact that this 
the definition of a concept, within experimental psychology at least. When the 
classification test is the only one used, 
resemble those on discrimination |e. 
The learning can be conceived 
stimuli and another to the neg; 


in zoology classes, for 
nts are asked to name the 


e been designed around 
operation has influenced 


experiments on concept learning 
arning with human and animal subjects. 
as associating one response to the positive 
ative stimuli. One can go further and define 
the concept as a common response to dissimilar stimuli (Kendler, 1961), but 
this definition is appropriate only for experiments by this method. Asking the 
subject to state the basis for the classification would not be a test of concept 
achievement by this definition, nor would the other tests listed next. 

Another approach to the definition of concepts, when classification is the 
test of achievement, is to treat the concept as the identification of the relevant 
attributes and the rule of combination. According to Haygood and Bourne 
(1969), a concept can be understood as a principle by which objects or events 


can be classified as positive and negative instances. This definition, referring 
to intensional meaning, 


is more realistic than the common-response definition, 
but it is still based on the classification operation and would not include other 
tests to be listed next. The classification of objects is certainly an important 
test of concept achievement, but it is not the only operation of interest and it 


1s not sufficient by itself for a general definition of concepts or for a compre- 
hensive evaluation of concept achievement 


ORDERING 


The ordering operation, for testing achievement of a dimension, is analo- 
gous to the classifying operation 


but more complex. Some intelligence tests 
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present a child with four blocks, of the same size, to be arranged in order of 
weight from lightest to heaviest. The effect of knowledge of a dimensional 
concept on learning a class concept has been investigated by giving children 
a dimensionality test, which requires them to arrange six blocks in order of size 
and six other blocks in order of brightness, prior to the learning task (Johnson 
and White, 1967). Successful performance requires knowledge of the dimen- 
sion as well as the ability to follow the instructions; failure could be due 
to lack of either, Unlike artificial classes, artificial dimensions are not easily 
manufactured for learning experiments under controlled laboratory conditions. 


VERBAL OPERATIONS z 
When one learns to respond to square figures with double borders by the 


word dax, we can sav that he is learning a discrimination, a class, a category, 
se. But, from another point of view, we can say that he 
is learning the meaning of dax. In the context of the experimental situation, 
the term dax refers to square figures with double borders. The versatility of 
concepts allows the testing procedure to move from objects to words, from 
words to objects, and from words to words. A child may be given a singular 
term, such as moon, and asked to tell what it means by pointing to the object 
in the sky or to a picture in a book. If a student is given a general term, such 
as evergreen, he can demonstrate that he knows the extensional meaning (by 
pointing to examples in view or by giving names of — the in- 
tensional meaning (by listing the attributes of the class). The student may 


give a synonym, a superordinate class, or a subordinate class, or he may use 
dnas st of this kind requires the subject to 


of logically related elements (Natadze, 


1962), Many abstract concepts of science and law are meaningless unless 
they are placed within a structure that includes other abstract concepts. i 
These verbal tests are applicable, with slight modifications, T terms that 
designate dimensional concepts as well as class concepts. lri : tion ee 
can be ordered in respect to a dimension of their nee oe ae 
is asked to arrange the names of metals in order of hardness, g 


ee ical order. 
the ng n: chologists in chronologica f ; ' 
names of eminent psycholog' e in multiple-choice 


i ificati nay b 
Tests of word usage, like tests of classification, may ones 
et i ond form, which requires scoring by 
form for objective scoring, or 1N open-en , r paa teke 
expert judges. But the agreement between two in =— ju ees sig 
definitions of concepts and the use of — en oe kee 
Mei 4. 

favorable conditions (Johnson and O'Reilly, e ae i 
1966). And many open-end tests can = ai 5 rw a o 
ski ice ve alternative ’ 

for example, by asking for a choice from 


or a common response. 


the word in a sentence. Another te 
arrange words in levels as a structure 


ENERAL 5 
used to study concept achievement at any 


I 7 s are s i 
2 n general, these er llege course, OF to record decreases in time and 
ne time, as at the end of a co i 
errors q gi t learning in the laboratory. The purpose may Dit either 
uring concept sss to measure differences between individuals, 


to investigate general principles or Piet wes ; : 
although the ee wet wos minor modifications. Individual differences in 


TESTING PROCEDURES IN G 
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itiv le are disclose: » the nature of the 
approach or cognitive style are disclosed by the iar sh 
people form when they are merely asked to categorize a v 
objects, by the number of groups they form, 


oups that 


aricgated set of 
by their preferences for classify- 
ing by one dimension rather than another, and by their flexibility in shifting 
from one dimension to another. 

The achievement of a concept is a quantitative 
a concept thoroughly, others partly, othe 
nature certainly appears in the study of 
sciences, if not in the study of easy concepts. The term number, for example, 
can be variously defined at the level of the fifth- 
Freshman, or the graduate student of mathematical logic; similarly, botanical 
classifications that are acceptable at the fifth-grade level may not be 
at the graduate-student level. Th 


above tests can be raised or low 
scoring standards. 

If a concept is viewed as an 
that successful performance on 
by direct association. When the 
sponses should be mediated by knowledge of the cl 
specific positive and negative instances. Whe 
these also should be mediated by 
ora hackneyed illustration. 
failure, when it occurs, c: 
concept rather than to in 
perform the response, 
noncognitive 


thing: some people know 
rs only slightly. This quantitative 
difficult concepts of the arts and 


grade pupil, the college 


acceptable 
ic proportion of subjects who pass any of the 
ered by adjusting the test materials and the 
ability construct, the methodology requires 
any test be mediated by this construct, not 
test requires the classification of objects, re- 
ass, not by the memory of 
n it requires verbal operations, 
knowledge, not by a memorized definition 
Likewise, the arrangements should be such that 
an be attributed to lack of achievement of the 
ability to make the perceptual discrimination, to 
to remember the terms, to pay attention, or to other 
components of the performance, 
Each of these methods of measurement h 
infallible. None is more fundament 
epistemological priority, 
is more convincing th 
correct, the results of 
Scores above ave 
another test of 


as certain advantages; none is 


al in any way than the others; none has 
A convergence 


an the results of 
different tests wil 
rage on one test of 
achievement of the 
©wn specific sources of difficulty, 
the difficult one for som : 
test may be the dificul 
able hypothesis that, 
test would be correlat 


of the results from two or more tests 
one, and if the present discussion is 
l converge. That is, the person who 
achievement will score above average on 
same concept. Each measure contains its 
to be sure. The classification test may be 
€ concepts (e.g., schizophrenic), while the definition 
t one for others (e.g., number); but it is a reason- 
if adequate tests were available, performance on one 
ed with performance on the others. 


Achievement of Figure Classes 


The bulk of the evidence on co 
on the learning of cl 
Projected on a scree’ 


ncept achievement comes from experiments 
asses of stimulus patter 


ns or figures printed on cards or 
n. Hence we 


will summarize the principal results of 
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research on figure classes first. Later, we will turn to word classes and number 
classes and will see that the results can be applied, with some exceptions, 
to these other types of material. 

The concept research of the past few decades has been strongly influenced 
by the pioneer experiments and interpretations of Clark Hull (1920). Other 
psychologists had presented various figures to their observers and had tried 
to use the introspective reports to derive generalizations about discovery of 
the features of a class. Hull put his investigation in the form of an objective 
learning experiment. The stimulus objects were printed patterns, resembling 
Chinese characters. The class to be named li, for example, contained two 
sloping lines joined at the top, embedded in different complex patterns of 
lines. Hull used the familar method of paired associates so that he could 
plot progress in terms of an increase in the number of correct responses. The 
subjects were expected to associate a name with each character and, with 


practice, to learn the correct name for each class. Finally, they were asked 


to draw the identifying feature of each class. 
its | d some objective results that have been verified 
in later experiments and are still under es ra It wis = 
common, for example, that a subject could learn to shat y seg correctly 
but could not draw the feature that all positive instances had in common. 
Hull noted also that learning could be speeded up if the gone pane s 
the class was made more salient by coloring it red. But the method and the 


b tant contributions. 
more important contr: . idb died 
ed that concept learning cou e studie 


such as the verbal response, li, to 
common feature but dissimilar in 
the response to each stimulus 


These experiments produce 


interpretation were the 
Hull’s experiments demonstrat 
as the association of a common response 
-dine iy respect to one 

figures that are similar in respect t a 
association theory, 
ciation between the correct response and 
ances will be systematically strengthened 


whenever a correct response happens to peann mo p — id 
the experimenter. Simultaneously, the associa x a At. eini tes 
sponse and other elements of the stimulus D dcaile abstracted from the rë 
the common feature of the cl k 


g only feature that is always 
Mainder of the stimulus pattern because ; 


h : A » response. 
associated with confirmation OF eee C1924) pointed out that such a theory 
A few years later, Edna Heidbredet 2 
years later, na 


~ ceiver of information from the 
treats the individual as a rather pa ive =e artial memory of positive 
environment, contributing little to the task but i p tea. Hews: the: irdi 
and ERRA The contrasting theory, she noted ; s m 
a Negative instances. EOSS ati ypotheses abou 
vidual age on re active part, formulating ‘hype "0 Sa 
al as taking a more a l ances. Much of our present knowl- 
co > sm against later instances. M is 
ncept and testing them age : come from attempts to support one or the 
i as ti i 
brenar one restricted conditions, mathematical 
certa a ; 
ee puter simulation becomes possible also 
and Cor 


other respects. According to 
will be random at first. But the as 
the critical feature of the positive inst 


ass will be 
it is the 


edge of concept achi 
other of these theories. 
models can be developed 
(Hunt, 1962; E. S. Johnson, 1964). 
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TASK VARIABLES OR CONDITIONS OF ACHIEVEMENT 


Aside from theoretical enterprises, considerable solid data on concept 
achievement has come from atheoretical attempts to improve the methods 
of studying concepts and to illuminate the conditions that influence concept 
achievement. We shall consider first the conditions that are known to have 
strong influences on concept achievement; they might be called general task 
variables. Then we shall return to some attempts at theoretical explanation. 


THE QUESTION OF ABSTRACTNESS 

“To abstract” means to pull out or separate something from its entangle- 
ments, and this is what Hull’s subjects had to do in order to classify Chinese 
characters according to the presence of a common attribute. In this sense, all 
concepts are abstract. But Hull’s classes were formed from concrete attributes, 
directly visible, while other classes are formed from more abstract attributes. 
The relational classes mentioned in connection with Fig. 2.1 are examples of 
more abstract classes since each is characterized by a relation between attrib- 


utes. Forming classes of classes is an even more abstract accomplishment. 
When we compare the Jones system for classify 


system for classifying roses, we are operating at a 
than when we are classifying roses by either syste 


ing roses with the Smith 
higher level of abstraction 
m. 

The hypothesis that the more abstract concepts are harder to learn is 
reasonable, and yet experimental demonstrations of this hypothesis have not 
been as clear-cut as expected. In one of sever 
Heidbreder (1947) used line drawings which could be sorted into three 
classes. Thing classes consisted of common objects, such as a hat; form classes 
consisted of nonsense figures; and number classes, supposed to be more ab- 
stract because they are based on numerical relations, consisted of three, four, 
or six small designs, In general, thing classes seemed to be the easiest to 
acquire and number classes the hardest. According to Heidbreder, the per- 
ception of concrete objects in man is the dominant mode of cognition be- 
cause his locomotive and manipulative capacities have priority. The more 
classification resembles perception of concrete objects, therefore, the easier it 
becomes. In later studies, however, this dominance hierarchy as it was termed 
was found to be a weak one, often obscured by other tendencies CHeidbreder, 
1948; Heidbreder and Overstreet, 1948), 

Further evidence on this question comes from the Wisconsin Card Sorting 
Test, which requires sorting by form, color, and number classes but does not 
include a class of concrete objects. In one study by this procedure, number 


classes were easier than form classes (Grant, Jones, and Tallantis, 1949). 
The Inconsistency in research 


: here is probably due to the selection of mate- 
tials to be classified and to procedural differences. The importance of the 
eriment by Wohlwill (1957) that included two 
epend on abstraction, the other on conceptuali- 
bstracted their own dimensions and classified 


al experiments on this topic, 


procedure comes out in an exp 
procedures, one intended to d 
zation. When the subjects a 
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cards on the basis of these, form classes were used more consistently than 
color classes. But when the experimenter told the subjects which dimension 
to use and measured the sorting times in seconds, with discriminability con- 
trolled, color gave better results than form. 

The question that Heidbreder raised did not receive a satisfactory answer 
from experiments with figure classes and remains still open. It comes up again 
in experiments with word classes, in studies of cognitive development, and 


in tests of cognitive style. 


POSITIVE AND NEGATIVE INFORMATION 

Is a No the opposite of a Yes? This question has intrigued psychologists 
for decades—since Smoke (1933) reported the results of his experiments on 
positive and negative instances. The subjects were presented with circles of 
together with dots in various positions. Those 
dot outside were to be classed as dax. 
plus sign and each negative 


different sizes and colors, 
circles with one dot inside and one 
Each positive instance was presented with a ; 
instance with a minus sign, but the information for learning the class 
than from the negative instances. 
es were often too general; 
to sharpen the concept de- 


seemed to come more from the positive 
Furthermore, classes formed from positive instanc 
the contribution of negative instances was mostly 
rived from the positive instances. 

This demonstration is not altogether convince 
not obvious what the subject could be expected to get from each instance. 
an be arranged, however, by telling the subject 
in advance what dimensions and what values of each are to be presented. 
Suppose the subject is told, for example, that there will be eight ie 
varying in size (large and small), shape (circle and triangle), an o 
Cred and yellow). These can be represented by their initial letters as: y 
Clarge red circle), LRT, SRC, SRT, LYG, LYT, SYC, and SYT. 

Let us say that the class to be learned is a conjunctive class of two dimen- 
sions and the subject sees LRC with the information that it is a page 
stance. He can guess that L and R are the relevant attributes, renra B 
would also be positive, or he might guess that L and C are t : yee 
attributes, hence LYC would be positive. The other possibility is that : 
C are relevant, hence SRC would be positive. Thus this — ha 
logically narrowed the positive instances down to four out of eig — m 

Suppose, on the other hand, that the first card presented nee Ae 
the subject was informed that this was a negative instance. He cou : 4" y 
exclude the SY, ST, and YT combinations and thus eliminate re RT, 
and LYT. Counting SYT, this information has logically oa oe our out 
of eight possibilities and four out of eight remain, eny t : rie be in 
the previous example with a positive instance. The number o alternatives 
was cut in half, hence in technical terms the amount of information in each 
instance was one bit. Yet the positive information is more useful to the 


average subject than the negative- 


ing, years later, because it is 


More clear-cut experiments ¢ 
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More complex sets of materials require more complex calculations, but 
the amount of information in positive and negative instances can still be 
equated. When experiments were carefully designed in this way, it was 
shown that negative information was utilized to some extent but not as much 
as positive information (Hovland and Weiss, 1953). Negative instances 
were more helpful when all instances were presented simultancously than 
when they were presented successively. 

These findings have been verified by other experimenters. Apparently, most 
people distrust negative information; it is indirect. Choosing one attribute 
because of the elimination of two others requires an additional operation, 
with the risk of making a mistake, and requires also the firm faith that 
there are only three attributes to be considered. This goes counter to ordinary 
habits of thought because life teaches us to avoid closed systems and a 
statement from the experimenter that there are only so many dimensions may 
not be enough to overcome such old habits. This is probably the reason why 
simultaneous presentation increases the use of negative information. 

Whatever the explanation for the reluctance to utilize the information in 
negative instances, it can be definitely reduced by practice. Freibergs and 
Tulving (1961) equated the information in their instances, as Hovland and 
Weiss did, but they gave their subjects advance instructions about using 
negative instances, as well as positive, to discover classes. Then one group 
practiced on positive instances and another on negative; the classes based on 
positive instances were learned quickly while those based on 
stances took longer. But, as Fig. 2 
about equal in number of errors. 

Some recent hypotheses about difficulties in concept le 
recommendations for pretraining in 
we shall see later; this procedure 
of the two kinds of inform 


negative in- 
.2 shows, after 20 trials the 2 groups were 


arning have led to 
analyzing information in the situation, as 
also reduces the differences between the use 
ation (Bourne and Guy, 1968; Davidson, 1969). 
Such results do not erase the differences that others have found, but they 
demonstrate that the difficulties of negative information can be reduced 
by thorough instructions and a little pretraining. 


When we turn from conjunctive to disjunctive classes, the contribution 
i negative instances is quite different because it 
t at a i otiv, ar i i i 
ee ees informative. (The reader can verify this statement by going 

o the disjunctive problems of Fig. 2.1.) Hence Chlebek and Dominow- 


ski (1970) arranged an experiment similar to that of Fre 
but with disjunctive 


Bae classes and found that those who were shown only 
negative instances discovered the concepts much earlier than those who were 
shown Positive instances. In addition, as in the case of conjunctive classes, 
the differences between the value of negative and positive information de- 
creased as practice continued. z 

Although we can describe the factors that influence the difficulty of con- 
cept learning, such as the nature of the class, the kind of information, and 
the stimulus materials, one by one, complex research designs have demon- 
strated that the effects of these factors are usually not independent: The 


is the negative instances 


ibergs and Tulving 
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Median time to solution (in seconds ) 


li ke he 
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FIGURE 2.2. Solution time for 20 concept problems for subjects with Tt instances 
and other subjects with negative instances. From Freibergs and Tulving (1961). 


i ive instances ¿é isjuncti s is onl 
special relation between negative instances and disjunctive classe: y 


one illustration of this. 
avan aan o o a two relevant dimensions 
The acquisition of a conjunctive class ased on 
re : ld obviously be more difficult than that of a class 
i Ý i i uch as 
based on only one. And a class based on three relevant pen 5 a ja 
size, shape, and color, would be even more difficult to grasp. ut t epe as 
: R : 
of irrelevant dimensions raises difficulties also. if the > sie co : 7 
shape, and color, and the class to be learned is the class of black circles, 
Large and small objects are positive instances; 
arge a c 
í ative i . Increasing the amount of 
large and small objects are negative instances r A c à 
relevant information in the materials presented for i see ra 
dificul and so does increasing the amount of irrelevant informa- 
ifculty, of course, and so s 4 A Beane. TAM, Th 
tion CHaygood and Stevenson, 1967; W al er tan 3 > x ; ese 
effects are not peculiar to visually perceived objects, because parallel ex- 
3 a =, . . P . 3 
periments can be done with auditory stimuli by varying the frequency, 
intensity, steadiness, presence of background noise, and location of the ear- 
AA, ? 


such as size and shape, wou 


size is an irrelevant variable. 
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phone through which the sound is heard; here again the difficulty of classi 
fication depends on the amount of irrelevant as well as relevant auditory 
information (Bulgarella and Archer, 1962). 


SALIENCE AND SIMILARITY 

In his early concept experiments, Hull (1920) was able to help his sub 
jects abstract the critical feature out of the complex patterns by making it 
more salient, that is, by outlining it in red. This is an old trick often em 
ployed in teaching anatomical and other spatial relations. A similar device 
to emphasize the figure is to vary the background so that the figure stands 
out. The classification of patterns is easier when the critical figure is shown 
with varying backgrounds than when the same figure is shown with a con- 
stant background (Turnure and Wallach, 1965). Likewise, if the critical 
feature is a simple drawing of one or two curved lines, combining three of 
these in a complex pattern makes classification 


quite difficult—as predicted by the Gest 


according to any one feature 
alt principle of good continuation 
CWallach, 1962). 


When synthetic classes are systematically constructed of familiar attributes, 
such as size, shape, and color, abstraction from background entanglements is 


not difficult but it is still possible that the values of one attribute may be 


more impressive than the others. The difference between a large and a small 


square, for example, may contribute more, or less, to the difficulty of the 
task than the differences between a white and a black circle. This “question 
can easily be examined by varying the difference between the two values 
of a binary dimension. Archer (1962) arranged a set of designs in which 
the two squares, for example, differed by %4 inch and another in which the 
two squares differed by % inch. Although the smaller difference was large 
enough for easy discrimination, in a set of objects that varied in many 
dimensions, some relevant and some irrelevant, the large difference was more 


a 
obvi ; 3 anean. FI 
vious and was more readily grasped as a basis for classification. The re 


sults of several experiments of 
the optimum condition would 
formation and minimize the 
comprehensive statement see 
ceptions. When the set of 


this nature led to the general statement that 
maximize the obviousness of the relevant in 
obviousness of the irrelevant information. This 
ms reasonable enough but there are some ex 
mare Wil we be stimuli included a few irrelevant features that 

J » they were quickly examined and eliminated from further 


consideration: i ji 
: ion; learning was actually faster than when these irrelevant fea 
ures were less salient C 


This factor of salienc 


s. Rommetveit (1960) required children 
wings that resembled the outlines of a 
gnition test as well as a sorting test. Ie 
shape just exposed when sorting he children had learned to recognize the 

§ was not yet above the chance level of ac- 
metveit and Kvale, 1965), attempts to 
hapes, early in the learning process had a 
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This factor of salience does not necessarily negate the findings reported 
above because it is common practice to rotate dimensions within a 
But when classification by one dimension is compared with classification by 
another, or when concrete classes are compared with abstract classes, the 
factor of salience becomes critical. Classification by color could be made 
easier than classification by shape merely by increasing the color differences; 
equally, classification by shape could be made easier by increasing the sha i 
differences. l ° d 

In general, we may say that the salience of a dimension may be an in- 
trinsic property of the stimulus patterns or it may be a consequence of prior 
experience. Various kinds of practice directed toward increasing knowledge 
of the stimulus patterns prior to discrimination learning, technically called 


stimulus predifferentiation, do facilitate concept learning (Tighe and Tighe, 


1966). 


MEDIATED ASSOCIATION AND CONCEPT SHIFTS 
concept, based on introspective evidence, 


The classical discussions of the 
this is analogous to a composite 


treated it as a generalized mental image; 
photograph, which emphasizes the common feature of a set of faces and 
blurs the specific features. If such a generalized image is aroused by percep- 
tion or recall, present instances can be compared with it and judged as mem- 
bers of the class or not. 

Modern psychology distrusts introspective ev 
of a mediator as a hypothetical construct to be validated by behavioral data. 
Hypotheses about mediators, to fill the gaps in stimulus-response behaviorism, 
are as old as behaviorism itself (Goss, 1961). The mental image, which was 
buried alive a half century ago and is now being revived, can also be con- 
sidered a mediator between S and R. Hypothetically, this mediator has some 
of the properties of the stimulus pattern; other mediators have been hypo- 
thetically endowed with some of the properties of responses, words, or cog- 
nitions. 

Modern hypotheses about 
ciation with two-unit associatior 
the S on one side and the R on t 
separately. The mediator acts as @ 
next R, thus: S 
the chain of associations m 
the nature of mediation hy 
account for the simple behavior of lower : 
mediated associations are more plausible for complex ieee ; 

The need for a mediated association theory 18 apparent when the subject 
classifies figures to fit the experimenters criterion tone gn tects a 
then the experimenter shifts his giitengn: “ae ty Mkea SP Se 
ample, that the figures to be sorted vary in size, shape, and color, and that 
choice of Jarg fon : is rewarded, shape and color being irrelevant. After 

arge figures i a hout informi a 
the subject ne correctly’ the experimenter may, wit rming the 
s g ? 


idence but permits the notion 


mediators replace so-called single-unit asso- 
a. The mediator is pictured as associated with 
he other, the two associations being acquired 
response to S and a stimulus or cue to the 
R. This picture is probably too simple; 
er. But two-unit association illustrates 
While single-unit associations might 
animals and young children, 


ay be long 
potheses. 
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subject, shift and reward on some other basis in order to analyze the cla ssif'y: 
ing process more thoroughly. There are two types of shift in this situation: 
a reversal shift requires the subject to shift to the opposite value of the 
same dimension; that is, if large figures were correct, small figures are now 
correct. This is also called an intradimensional shift since size is still the 
relevant dimension. A nonreversal or extradimensional shift, 
hand, requires the subject to shift to some 
angular figures of any size or color will be called correct. 

The behavior of children and adults following such shifts led Howard 
and Tracy Kendler and their associates (Kendler and D'Amato, 1955; Kend- 
ler and Kendler, 1962) to a series of new experiments and an attempt at 
theoretical explanation in terms of mediated association. A reversal shift 
seems to necessitate the relearning of all associations between figures and 
responses: what was positive is now negative and vice ver 
ments have shown that reversal shifts 
than nonreversal shifts, though nonreve 
young children. 

Consider a simple situation in which 
subject has to learn names for cl 
first response is an orienting one 
the tight. This response brings ir 
implicit cue or mediator for the 
terminal naming response, 
chain, including the me 
ened. More often, of co 
location. The critical a 


on the other 
other dimension; perhaps tri- 


. Yet the experi- 
are easier for adult human subjects 
rsal shifts are casier for animals and 


a few figures are displayed and the 
asses. It is reasonable to suppose that the 
, for example, looking toward the figure at 


nto view a figure, which in turn acts as an 


of head and eye movements but 
be described as observing co 
mediating response w 
confirming response j 

This mediation hy 
ments resembling tho 
and Learnard (1962) 
stimuli are used: one 


off atmo e test phase are required. In Phase I, the two pairs 
i | i 
muli are presente: and different subjects are rewarded for 
he basis of one of the four stimulus ele- 


the large white square in one pair and 
rewarded. After the subject 
is phase, Phase II presents only one 
r choosing the previously negative stimu- 
as correct in Phase I, black is correct in Phase II. 


pair, and the subject 


is rewarded fo 
lus; for example, if w 


hite Ww: 
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But the choice in Phase II is ambiguous because the small black square could 
be chosen cither for its color or its size. If the choice is for color, the subject 
has reversed from black to white, hence this would be called a reversal shift. 
It is also an intradimensional shift, since the subject continued to orient to 
the color dimension. If the choice is on the basis of size, it would be called 
a nonreversal or extradimensional shift. Phase III is intended to determine 
which it was. After the subject is responding consistently in Phase II, the 
other pair is reintroduced as a final test. If the subject chooses on the basis 
of color, presumably he was using color throughout Phase II; therefore he 
reversed. 

The mediation hypothesis states that the subject orients to a dimension, 
such as color, and a value of this dimension, such as white, is associated with 
the next response in the chain. If the final instrumental response is mediated 
by orientation to this dimension, a shift on this dimension (e.g., from black 
to white) would be easier than a shift to some other dimension (e.g., from 
black to small). The difficulty of the nonreversal shift is that it requires 
learning to orient to a different dimension. The hypothesis that a reversal 
shift is easier defies intuition and simple single-unit associationism, because 
after a reversal shift an association that was right becomes wrong and one 
that was wrong becomes right. A number of experiments on reversals have 
shown that, while the simpler associationism may be adequate to explain 
the discriminative behavior of animals and young children, the mediation 
hypothesis, or something similar, must be added to it to explain the be- 


havior of older children and adults. 
Kendler, Kendler, and Learnard (1962) included different age groups in 


their experiment and found that children of ten made more reversal shifts 
than the younger children. They take this result as evidence that the older 
children have verbal mediators, such as the words color and shape, that help 
them maintain their orientation to a dimension. Such research results do not 
mean that the young children lack the vocabulary to describe the simple 
i these as implicit cues for the instru- 
as older children do. Tracy Kendler 
number of reversal shifts at younger 
for the positive 


concepts used. Rather, they do not use 
mental response, at least not as much 
€1964b) has been able to increase the 
n children to verbalize labels e 
and negative stimuli: “The square is the winner and the circle is the loser, 
or “The black wins and the white loses.” Kendler believes that “at this de- 
velopmental stage overt verbalization has the same effect that the model 
ascribes to covert verbalization among more mature human Ss [p. 434]. 


: ree 7 hifts are often easier 
The results of other experiments agree that reversal sł en 
in terms of the mediation of 


than nonreversal shifts, but the explanation r i ` i 
implicit cues is not beyond dispute. An experiment with college students by 
1 al superiority of the reversal 


Isaacs and Duncan (1962) disclosed the usual s 
n comparison with a control group 


ages by requiring kindergarte 


Srou , but i 
Stoup over the nonreversal group, : ; 
the reversal group showed a significant negative transfer. They interpreted 


the results in terms of nonspecific transfer of ag o tasks of se ae 
and a specific transfer which may be negative, on the dimension reinforce 
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in training. Erickson (1971) was not satisfied with the general a 
reversal shifts are easier than extradimensional shifts even for adu s > 
reported, after a series of experiments, that the nel oe yen 
completeness of the instructions. When he gave col ege = ents a w 
concept problem with the usual brief instructions, the extra pot w aa 
problem was learned after about three times as many errors as the ne 
shift. With more thorough instructions—explaining in a general way how to 
use the information given by confirmation and nonconfirmation—the dif- 
ference in diffculty was in the other direction and the reversal shift required 
about twice as many errors. The activities of the subjects with brief instruc- 
tions seemed consistent with mediated association theory, while those with 
thorough instructions were more consistent with a hypothesis-testing model 
Cto be described next). 

A reversal shift is an intradimensional shift, but an intradimensional shift 
is not necessarily a reversal shift. To keep things simple, many experiments 
have used only two values of each dimension: the colors are red and green, 
the shapes square and triangle. But a dimension may have more than two 
values. If red figures are positive and green figures are negative, a shift to 
yellow-positive and blue-negative would be an intradimensional shift but not 
a reversal. Thus when classes are constructed from four colors and four 
shapes, there are three types of shift to be compared: extradimensional Cor 
nonreversal), intradimensional, and reversal. 
P. J. Johnson (1967) disclosed that reversal sh 
college students, causing only 
to the shift. Intradimensional 
dimensional shifts were quite d 
the mediator is rather general, 
dimension. Reversal from red. 
ficulty as a shift from red-gree 
of attention, one could say 
rather than a specific color 

Reversal shifts in children of four, six, and eight can be increased, as in 
college students, as a result of verbalization that calls attention to the correct 
dimension (Jeffery, 1965). Tighe and Tighe (1968b) also taught young 
children to make reversal shifts, simply by giving them predifferentiation 
training to familiarize them with the independence of the different dimen- 
sions, Thus these writers find it possible to account for much of the research 
on discrimination learning in terms of perceptual differentiation: learning to 


distinguish the different stimulus patterns is more important than learning 
the responses to them. 


The design of the early experiments emphasized specific stimuli, cues, or 
bipolar values of a di í 


mension; but recently it has appeared that the dimen- 
sion can be the medi p 


Systematic comparisons by 
ifts were relatively easy for 
8.00 errors on the average before they adjusted 
shifts were also easy, 9.58 errors. But extra- 
ifficult, 22.75 errors. Such results suggest that 
a dimension rather than a specific value of a 
green to green-red was about the same in dif- 
n to blue-yellow, To put these results in terms 
that the subject learns to observe color Cor shape) 


Cor shape). 
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search designs, whatever the final interpretation of the results may be. At 
present it seems that the results can be interpreted alternatively in terms of 
selective attention, perceptual differentiation, and verbal mediation (Kendler 
and Kendler, 1966; Tighe and Tighe, 1968a; Wolff, 1967). This research 
trend may be more relevant to attention, perception, and discrimination learn- 


ing than to concept learning. 


SELECTING CUES AND TESTING HYPOTHESES 

The more active approach to concept learning, which Heidbreder (1924) 
described many years ago as a search for common features, attributes more 
complicated problem-solving activities to the subject of a concept experiment. 
Single-unit and mediated association theories assume that the subject is alert 
and observant, to be sure. But the more dynamic theories assume that the 
subject takes the initiative by looking for similarities in a series of instances, 
formulating hypotheses about common features, and testing these on later 
instances. It has been known for some time (Reed, 1946) that instructing 
the subject to look for common features facilitates concept learning in com- 
parison with merely instructing him to learn correct responses. Thus, asso- 
ciation theories portray concept learning as a continuous improvement in 
performance, the correct associations being gradually strengthened, while 
hypothesis-testing theories portray it as a discontinuous process. Formulating 
and testing an incorrect hypothesis would produce no consistent improve- 
ment; formulating the correct hypothesis, however, would be followed by 


rapid improvement. 


SELECTION AND RECEPTION STRATEGIES f 
The hypotheses that people formulate and the procedures by which they 
test them were examined by Bruner, Goodnow, and Austin (1956) in a series 


of experiments from which the cards of Fig. 2.1 were taken. They looked at 


concept attainment as the coding of experience into categories. The sub- 
ject’s task is to discover which attributes of the perceived objects are useful 
cues for distinguishing the objects that belong together in a category from 
those that do not. One way to study the subject's categorizing efforts would 
he to ask him to describe what he is doing, but this method is open to soe 
criticisms and has not, in the past, yielded convincing evidence. If zr su a 
Ject is doing his best to discover the concept, he cannot also take on t e ex- 
Perimenter’s task of scientific investigation, at least not without some sacrifice 
in efficiency of one or both tasks. If the dimensions are familiar ones, like 
size and shape, however, and the subject chooses to test a hypothesis that the 
class is a simple combination of these, then statement of the hypothesis is 
not too much to ask. Sequences of responses can, of course, be objectively 
recorded and analyzed; and the subject’s statements about his hypotheses can 
be checked against his actual responses. f l 

As one can see from Fig. 2-1, familiar attributes were used in these ex- 


periments. The difficulty consisted not in learning the attributes but in the 
cognitive strain or memorv load involved when the information pertinent to 
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a hypothesis was gathered over a period of time from a series of instances. 

Any method of collecting pertinent information and organizing it so as to 
evaluate a hypothesis is called a strategy. When the selection method of 
presentation was used, with all cards exposed, the experimenter helped the 
subject to get started by giving him one positive instance. After that, the 
subject selected one card at a time and was told whether it was a positive 
instance or not. The experimenter could reconstruct his strategy fairly well 
from records of cards selected, and, in addition, the subject was encouraged 
to hazard a hypothesis after each selection of a card. Thus, in this type of 
experiment the subject knew the materials in advance and what he was 
supposed to do. What he did not know was the concept the experimenter ex- 
pected him to acquire. 

The possibilities for approaching such a problem are numerous; 50 people 
working on such a complex problem could use 50 different strategies. For- 
tunately, the variation is not that large. It is possible to describe a few logically 
ideal strategies, and then to observe whether the subject's behavior conforms 
to one of these types. One ideal strategy would be to collect and integrate all 
the information simultaneously and work out a solution logically. But this 
simultaneous scanning, as it is called, is bevond the memory capacity of most 
people. None of the subjects of this experiment, who were college students, 
really tried it. A more feasible strategy is to begin with a hypothes s, and 
then to select cards that yield information pertinent to it. This strategy, which 
is called successive scanning or part scanning, because the subject needs to 
scan and remember only the part of cach card that is relevant to his hypo- 
thesis, was frequently employed by the students. It has the disadvantage that 
the subject concentrated only on part of what he saw and was not likely to 
learn much while he was following a hypothesis that proved to be wrong, 
although under some conditions some subjects were able to remember a little 
even about cards irrelevant to that hypothesis. 

A vay efficient strategy known as conservative focusing consists of begin- 
ning with the first positive instance and collecting information around it. 
ae has to have a correct instance to begin with, and in this experiment the 
yer ite À paa aaa instants as a starter. He did not understand at 
ainme We ah a positive instance but he could find out by main- 

5 us on the card and selecting another card that varied from 


the known itive i i 
eines i e Instance in only one attribute. If this second card was 
Ive Instance, the attrib i i 
ibili a ute varied mus leve has 
med ust be irrelevant, and t 


vinced. Tf fhe ty ae ye (Some subjects were not so easily con- 
the subject then chose a ated al ~ two red squares with one border and 
was told that this card was fon two green squares with one border and 

also positive, he had logically eliminated color 


as an attribute of th ; 
e class. But a few subjects preferred to choose a card 


with two black squares and 
one border next, į p 
i i : iba , Just to be doubly sure.) If the 
card chosen was a negative inside: ) 


relevant. In either case, this focus; the attribute varied must have been 
j ia ocusing is wholistic: ne 
itive i 5 olistic; th o: sed on 
the first positive instance as a akale e hypothesis is ba 
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These st i i 
se strategies were resti ad i ini 
ou ee p 5 investigated in the context of attaining figure con- 
Se : nr of the ssn learned here may be applied elsewhere. The 
-e between simultaneous sci i j i Si 
a et A A ES : scanning and conservative focusing should 
a a s er : e asachi Ten. : K 
ee e who has searched the scientific literature in order 
se anv npepe because he is bothered by the same effect of too much 
rmation. One cou at i 
ar pte ye = go through the recent issues of a likely journal 
aie} i f ponder ele “et Or one could, as soon as he came across a useful 
article, focus on it and then choos i : is 
: a se other articles i e li i 
Se teeta on S Eee AES | ag cig light of the informa- 
s hrs re instance. Eac i E 
tages, but the second is more efficient in general k in ri adas 
Conservative wholistic focusing requires à D itive ¥ 
sad a Ue patience: (E the ed ae a I ositive instance to begin with 
s a hurry EET 
chance, he may change two E CARGUES si f and is willing to take a 
Tia 2 a sa $, his is a risky ki q 
sailed focis dabing ce oll e a ete is a risky kind of focusing, 
Ges cece £ a a solution sooner by thi aiea 
if he makes a lucky guess; but if he makes a bad guess = ig 
saute mme es : a bad guess and se 
negative instance, he will not know which of the ae hi ae 
is responsible for th r H ay ev concen wtiribnras 
4 ie error. He may even forget the positive focus d. A 
meee f jane OE s card. 
he names imply, conservative focusing was followed by moderate gai ar 
losses, while focus ge bli cas followed i ins ee one aa 
= s gambling was followe by big wins and costly erro 
PSN ; X a \ rs. 
M: he other situation, in which the subject receives information regular] 
but c c meee ; cei ey 
annot choose what information he receives, was also studied by hess 
reception strategies. They used the method 
e experiment more like a conventional 
Id not pick his cards but he could 
t the cards as they came to him. The 
fied in their choices. The wholistic 


strategy, concentrating on the first positive card as a focus card, was found to 


be more frequent and more efficient than the part scanning strategy. 
Similar experiments by Bourne (1963) and others have also found the 
focusing, or wholistic, strategy to be more efficient, but it is 


most frequently used. Most subjects can profit from informa- 
but many errors come from the failure to remember 
enough to see their implications for testing a hypothesis 
1960). This is one reason, no doubt, why concept 
presentation takes longer than by simultaneous 


experimenters in order to describe 
of successive presentation, making th 
learning experiment. The subject cou 
formulate a hypothesis and test it agains 
game two types of strategies could be identi 


conservative 
not always the 
tion just received, 
earlier instances well 
(Cahill and Hovland, 
achievement by successive 
presentation. 

The effectivene 
efforts to use it in te 
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grade children tried to 
to the simpler procedure 
Stances, The fifth-grade students, who use sim 
performance. When university students, however, 

sblems, it was the 


ș for solving concept pro 
he most successful (Klausmeier and 


yservative focusing strategy has stimulated 
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but soon reverted 


the positive in- 
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The generality of these descriptions of information-processing strategies is 
complicated by the results of experiments by Bruner, Goodnow, and Austin 
(1956) with thematic materials. The thematic cards bore drawings of people 
who differed in sex (male or female), in dress (day clothes or night clothes 
and other attributes. Concept attainment with a set of these materials went 
slower than with a parallel set of meaningless abstract figures. The subjects 
used similar strategies, but they were distracted by reasonable and familiar 
hypotheses about possible groupings. They clung to certain attributes, classify- 
ing by sex, for example, after these attributes had logically been clim- 
inated. 

In general, the wholistic focusing strategies were more efficient than the 
scanning strategies, especially under difficult conditions, as when time pres 
sure was applied and the display of information restricted. But the ques 
tion of efficiency is an ambiguous one that cannot be clearly answered 
unless we know the subject's motivation. In these concept attainment tasks and 
in the tasks of later chapters, the motivation may usually be described as 
cognitive. The subject tries to maximize his successes and minimize his 
errors. Yet some situations emphasize the attractiveness of success and others 
emphasize the cost of error. In the experiments of Bruner, Goodnow, and 
Austin, the subjects adjusted their approaches to the situation at hand with 
some flexibility, adopting a risky strategy when they had only one try and a 
more conservative strategy when they had several tries. When the experi- 


menters arranged conditions so that the subjects’ guesses were right, they 
adopted more risky strategies. 


PROBABILISTIC CUES 
After a system of categories has been thoroughly mastered, each element 
ak the system—that is, each value of each Himension-—seeeis to belong in 
its proper place. But during the learning process an element is only a cue 
that may or may not be a criterion to define a category. Most experiments 
are planned so that each feature has exceptionless validity as a cue to the 
Category. If a class of triangular objects is to be learned, choices of other 
objects are never confirmed. Considerable difficulty is introduced when the 
pi of a cue is neither 100 percent or 0 percent. In such experiments, 
adie bes ba aae pi of the choices of triangular objects 
Goodnow, and Austin "Reseed dar sil e Pie om ani 
siderably and the difficulties d variations slow down learning id 
maed. ue to the factors mentioned above are mag 
e les eee of cues from 100 percent to some less proba- 
: may seem, nor is it an arbitrary scheme to con- 
fuse the subjects. Life is like that. Most of the classification systems that 
106 has to learn could be described as useful but less than perfect. The 
percent condition is an interesting special case, but probabilistic con- 


itions are 
ditions are more common and must be considered not only here but in 
other chapters to come. i 


ACHIEVEMENT OF FIGURE CLASSES “ 55 

It is worthwhile at this point looking back to observe how far this type 
of concept research has progressed from the early experiments by Hull. 
Instead of associating names with figures exposed at regular intervals in a 
memory drum, the subjects try to solve a problem, to discover the defining 
attributes of a class, with a considerably flexible approach. Since they know 
the dimensions of the problem in advance, they can proceed more or less 
rationally and adjust their strategies to the information given. Their re- 
ports of their hypotheses are consistent with their hypothesis-testing moves 
(Schwartz, 1966), and those subjects who are explicit about their strategies are 
the most efficient (Eifermann, 1965). The subjects examine the stimuli and 
choose their responses, not only by association with a reward, but in part 
by balancing probability of success against probability and cost of error. 
The series of experiments by Bruner, Goodnow, and Austin, which was 


called A Study of Thinking, described the subjects’ activities in such terms 


as inference, choice, and strategy, and thus played an influential role in 
ativelearning framework to the 


shifting concept research from the associ 
framework. Subsequent inquiries have 


problem-solving, decision-making 
improvements and variations of both 


been concerned with methodological 
Practical and theoretical significance. 


A TWO-PROCESS DEMONSTRATION f ; 
he instructions and materials of the experiments described above were 


chosen so that the concept would be acquired in an active, problem- 


solving n , the gradual strengthening of associations. 
g manner rather than by g s 
the other. The re- 


But the demonstration of one process does not negate l 

lation between the two concept attainment processes was clarified by 
Podell (1958) by means of differential instructions and more complex 
T eri sy . à , i 
Matende During the learning phase, both of her groups m a 
neaningless figures; but one group was instructed to evaluate them aes- 


thetically Cunintentional set), while the other group was instructed to look 


for common features (intentional set). Both groups were later asked to 
f figures, and these defini- 


Write the defining characteristics of the class o! c 
tions were scored for the number of correct common features mentioned. 

Presumably, whatever concept learning occurred under the unintentional 
set would be a summation of positive and negative associations, whereas 
the intentional set would promote active search and a more efficient concept 
achievement. The other important variable was the variety of instances. 
ome subjects received only 2 examples while others received 12. The 
expectation was that the unintentional set would be sufficient for the small- 
Variety condition when only 2 figures were seen repeatedly, and that the 
intentional set would be better for the larger variety of instances, because 
the subjects would be trying to formulate and test hypotheses to account for 
all instances, The results, as shown in Table 2.1, confirmed both expecta- 
tions, at least for these intricate figures. The possibility of concept learning 
Y two processes, long suspected, thus received strong support from these 
data on definitions. 
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TABLE 2.1. MEAN NUMBER OF COMMON FEATURES 
MENTIONED IN DEFINITIONS ACCORDING TO 
INSTRUCTIONAL SET AND VARIETY OF FIGURES 


Instructional Variety of Figures 
Set Large Small 
Unintentional 72 11.87 
Intentional 17.37 10.37 


From Podell (1958). 


SERIAL EXPOSURE 
If we treat concept attainment as a problem to be solved rather than as 
the learning of responses, the main independent variable need not be 
practice. Another method for studying concept attainment processes requires 
the subject to select a common dimension in a displ 


ay of figures and then 
to test his selection b 


y applying it immediately to a second display of figures 
(Johnson, 1961). For the serial analysis of problem-solving activities, the 
apparatus is so arranged that the subject can illuminate cither display but 
not both at once. Thus hypothesis formulation is separated from hypothesis 
testing. Fig. 2.3 shows a problem that is ambiguous since the common 
dimension of the figure on the left side m 
jects who select either will find a s 
Subjects who had seen several displ 
shape selected the shape solution 
display of figures on the right cont 
had selected the shape dimension 


ay be shape or texture, and sub- 
atisfactory solution on the right side. 
ays of figures that were easily classified by 
to all ambiguous problems. When the 
ained only a texture solution, those who 
could not find an appropriate solution. 


FIGURE 2.3. Figure concept problem. Instructions: “Examine the figures on the 
left and note what they have in common. Then look at the figures on the right and 
find another of the same class.” From Johnson (1961). 
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They might have perceived and remembered both dimensions but apparently 
they did not—all the subjects had to switch back to the left side to observe 
the other dimensions, then return to the right side and identify the correct 
figure. Thus this experiment emphasized the discontinuities in solving a 
concept problem by allowing the subject to interrupt his own activity anál 
seek more information when he saw that he had selected the wrong di- 


mension. 


ATTEMPTS TO DEFINE THE CLASS 

Many ordinary citizens can classify oak trees and maple trees but can- 
not describe the identifying features of each. Laboratory reports have also 
noted, since the beginning of concept research, that many subjects after 
learning to classify objects cannot define the class. Subtle theoretical im- 
plications have been elaborated around this common occurrence, but an in- 


terpretation that takes account of the methodology of concept experiments 


is more convincing. 

When the classification test anc 
nate measures of achievement of the concept, any expectation of consistency 
between the two assumes that both are adequate measures of the achievement. 
But typically the defining task is introduced at the end of the experiment, 
with no practice, while the classifving task is practiced from the beginning. 
definitions are hard to write at best, and the complex 
early experiments were patterns of almost in- 
definitions presents a methodological 


1 the definition test are treated as coordi- 


Furthermore, good 
figures used in some of the 
describable shape. The scoring of the 


problem that has usually been overlooked. 
when the attributes are familiar shapes and 


In many recent experiments, 
If the attributes to be considered are ex- 


colors, definition is much easier. 
plicitly denoted in advance, then the universe of discourse is restricted to 
of combination suffices as a definition of 


them and a statement of the rule 
at the class consists of red triangles, 


the class, If the subject states th 
triangles and red need not be further defined. In such experiments, an 


attempt to formulate a rule of combination of the attributes, either to com- 
municate to the experimenter as a measure of achievement or to retain 
in subsequent trials, amounts to an attempt at 
In attempting to describe the 


learn in just classifying the 


as a hypothesis for testing 
a definition. Are such attempts worthwhile? 
class, do the subjects learn more than they 


instances? 
One experiment on the effectiveness of practice in defining required 


children of about twelve years old to learn the concept of gunkle bird 
(Johnson and O'Reilly, 1964). Pictures of birds on cards had red, yellow, 
or blue wings, orange, green, oF black tails, and short, long, or hooked 
beaks; there were 27 cards in all, with black tail as the defining feature. 
A parallel set of 27 cards had verbal phrases, such as red wing and orange 
tail, in the same combinations. Each child in 1 group was given the pictorial 
cards, l at a time, asked to guess whether it was a gunkle bird or a bunkle 
bird, and told when he was right. After attaining the correct classification, 
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each subject was asked the defining question: “How do you think you 
can tell a gunkle bird from a bunkle bird?” Another group practiced clas- 
sifying the verbal cards by the same procedure. A third practiced on the 
pictorial cards and, in addition, was asked after each 5 trials to attempt to 
state how to tell a gunkle bird from a bunkle bird. The results demonstrated 
that this small amount of practice in defining, even without knowledge of 
results, increased the quality of the definitions. 

The explicit stating of a hypothesis is good for college students, too. In 
the typical concept experiment with college students, the subject states a 
hypothesis whenever he thinks he is beginning to catch on; but when he 
is required to state a hypothesis systematically after each trial, he is likely 
to achieve the concept more quickly (Byers and Davidson, 1967). If the 
hypothesis that is offered is called wrong, however, the subject hesitates 
before offering another (Byers, 1965; Wallace, 1964). Disconfirmation of 
a hypothesis has an aversive effect. After some discouragement of this kind, 
most subjects tend to offer a hypothesis only if it is firmly established. 


HYPOTHESIS-SAMPLING MODELS 


When concept achievement is treated as the formulation and testing of 
hypotheses, it is possible, with certain simplifying assumptions, to describe 
the subject's activities in mathematical terms. Mathematical models based 
on hypothesis testing, which are quite different from those based on asso- 
ciative learning, have been rather successful in accounting for the data of 
me learning experiments and have made a strong impression on the 
Syc. g izi i 
ps ologica theorizing of the past decade (Bower and Trabasso, 1964: 

a 1970; Levine, 1970; Restle, 1962; Suppes, 1965), 

aie APRA Te . à ; 
P e nee employ a variety of mathematical manipulations which need 
y R 
à consi ered here. But the assumptions about psychological processes 
eserve our consideration beca he s f a idity 
a secon ause the success of a model supports the validity 
mpti ich it i i 
is Aik ra i which it is constructed. In general, a model of this 
bs leg SAh rat the subject samples the attributes or cues that he per 
cng yh he sn ea until he finds a cue, or a combination of 
> 1s consistently i i i i 
wieder 2 y confined as correct. Since he is actively trying tO 
es ii 
a veo a = “reste of a cue amounts to a hypothesis that the 
aa Pi made a sy models are called cue-sampling or hypothesis- 
s k n A á ii a s . 
ah eenry para i hypothesis is confirmed, he continues 
. : 
pl ith oa ge o correct responses remains high. Until this 
ind alled presolution trials, he shifts f 
sam ping wilh cecil , ts from one cue to another, 
g mpling at random and, if the ial h 
Ea enn ih materials are complex, he 
er anything useful about the instances he has seen. There- 
fore, according to the model, the subject makes no progress until La fannu- 
lates the correct hypothesis. An analysis of th a : 
and S ) e responses will show correct 
responses and errors with chance frequency up to the last Tk as Ae 
ee s : e last error. It is n 
alwavs necessary to identify the subject’s specific hypothesi ; T fined 
3 y esis; in well-denne 
sets of figures, one wrong hypothesis leads to the Se p i N r z 
f N pgr: € roportion of errors 
as anv other wrong hypothesis. However, the Spine eos infer what 


ACHIEVEMENT OF FIGURE CLASSES w 59 


acting on by skipping the usual informative feed- 


hypothesis the subject is 
responses that are neither confirmed or discon- 


back and observing a few 
firmed. 

Whereas associative theories portray progress in concept learning as a 
continuous strengthening of the reinforced responses, hypothesis-sampling 
models portray it as a discontinuous, all-or-none phenomenon. Obviously, 
this model requires analysis of the responses of each subject individually 
because the last error is made by different subjects on different trials, and 


group data show a continuous improvement which obscures the individual 


discontinuities. 

Bower and Trabasso ( 
in data from college stude 
mension, among several irrelev 


1964) found support for the all-or-none assumption 
nts learning a class defined by one binary di- 
ant dimensions. For any one subject the 
proportion of correct responses prior to the last error was no greater follow- 
llowing an error. This proportion, actually, 


remained constant at the chance level during the presolution trials. Various 
patterns of multiple-reversal and dimensional shifts in what the experi- 
menter called correct did not influence the error rate prior to solution 


(Trabasso and Bower, 1964). Several studies of children learning mathemati- 
all-or-none assumption CSuppes, 1966). 

The difficulties that have arisen to plague models of this type involve some 
above, especially the question of when and how 
theses that he tests. It is not certain that a 


subject remembers nothing from hypotheses he has tried and njai Se 
basso and Senden 1968). Apparently, what is remembered ge s 
partly on the time allowed the subject between trials to think pou t oni 
formation just received (Bourne, Dodd, Guy, and Justesen, me “in 
hypothesis sampling models assume sampling without ia oe a bari 
Some conditions the subject will resample a hypothesis he has pa cir r A d 
and rejected (Erickson and Zajkowski, 1967), and he may odd <a 
Pothesis after a response has been confirmed as correct (Do = > r "i 
1969). Under some conditions the alert subject will fogun an 7 

ypotheses concurrrently or he may adopt a working hypot o me 
Sponses but also monitor other hypotheses of interest na or ieee 
the focusing strateov and the scanning strategy are not so sharp y i s >r 5 
as they semed t be ten vears ago- Furthermore, multidimensiona ; - 5d a 
may be acquired in two or more stages, which has the anna of decre 
ing the difference between continuous and discontinuous mode S. 


The hypothesis-sampling models have already dejane rai ve 
f i aulation and testing © ypotheses and as 


as explici i e forn 

eade arra p a to the associative approach, although, 
when the situation is complex, the possibility of associative learning at some 
Stage cannot be overlooked (Overstreet and Dunham, 1969). This is an 
active, intriguing area of research at presents futile ae was on yoa 
sampling assumptions will no doubt be discovered and more comp icated 


models elaborated to meet the criticisms: 


ing a single correct guess than fo 


cal concepts also support the 


of the assumptions mentioned 
the subject formulates the hypo 
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COMPLEX CLASSES AND RULES OF COMBINATION 


The difficulty of acquiring classes increases, as we have seen, as the number 
of relevant and irrelevant attributes increases. But the difficulty is determined 
also by the structure of the class, that is, by the way the relevant attributes are 
combined. For a systematic comparison of the difficulty of different types of 
concepts, Hunt and Hovland (1960) arranged an experiment with stimulus 
materials that varied in color, shape, and number in such a way that each 
subject could organize the instances in a conjunctive, relational, or disjunctive 
class with equal validity. Each subject received first a training series that 
familiarized him with one example of each type of class, and then a test series. 
The test series could be classified in any of the three ways, but it was found 
that conjunctive and relational solutions were used much more frequently 
than disjunctive ones. It is possible, of course, to arrange special training in 
the use of disjunctive classes; this increases the proportion of subjects who 
offer a disjunctive concept when a conjunctive concept is also available and 
it even increases the number of disjunctive rules given for forming the classes 
CWells, 1963). The training does not render disjunctive classes as casy as 
conjunctive, however. f 


COMPLEXITY AND DIFFICULTY 
Actually, there are many ways in which a set of objects can be classified, 
some of which seem to be more complex than others. To evaluate the effects 
of complexity systematically, Shepard, Hovland, and Jenkins (1961) used 
several sets of eight stimuli—including pictures of familiar objects as well as 
meaningless designs—arranged in six levels of complexity. The least complex 
were unidimensional classes; next came classes with two dimensions. Classes 
with three dimensions increased in complexity as disjunctions and exceptions 
ae added. The experimental results showed that the classes which the 
a Ween considered complex were, in fact, learned more slowly. Neisser 
Sela kramm Ae z k pene m bs bes 
bination. These sna RE ith i complexity of the rule o i 
appear when a com polntediout that the order of difficulty does no! 
puter program is used to attain the concepts by simple 


elimination. “It i 
r seems to reflect a hierarchical organization of conceptual 
processes in the Ss themselves [p. 645].” l 


But if a class ; 
a class appez ? 5 : 
Ppears complex to the experimenter, does it also appear 


complex to the subj j 

: Jects who have to learn it? An enlightening outcome 0 
the Shepard, Hovland, and Jenki ; ba i : 
subjects write rules fo ins experiment was obtained by having the 
a close correlation faa ee the classes. It turned out that there was 
ammpleo a ale wa “a complexity of rules and difficulty of acquisition. An 
$ a a ihe telt: ei i for the construction of a simple class was: “All 
ja ass “All Visal he es on the right.” A rule written for a more complex 
age r Aa emery the left and white figures on the right except 
the large black circle should be exchanged with the small white triangle 


ue 
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[p. 15].” A correlation of .90 showed that those classes which were difficult 


to learn, in terms of time and errors, were the ones for which the subjects 


wrote complex rules of classification. 


ATTRIBUTE LEARNING AND RULE LEARNING 

If the class is unidimensional, defined by one attribute, that is all there is 
to be learned; but if the class is multidimensional, the subject must learn 
for combining them. These two aspects 
ap under ordinary conditions, 


the relevant attributes and also the rule 
of the conceptual task may alternate or overl 
but Haygood and Bourne (1965) have shown that they can be separated 
quite easily. Attribute learning can be studied by telling the subjects the rule 
but not the relevant attributes—as was done 
viewed above. Rule learning can be studied by 
attributes but not the rule of combination. 


When college students learned several typ 
this analytic procedure showed that at- 


and contributed more errors than rule 
asses consisted mostly of learning 


in most of the experiments re- 
telling the subjects the relevant 


es of classes, constructed from 


the conventional geometric figures, 
tribute learning required more time 
learning. The acquisition of conjunctive cl 


the attributes; learning the simple conjuncti 
a negligible contribution to time and errors. For example, those subjects who 


had to learn the attributes made an average of 4.0 errors before mastering 
the first problem, while those who had to learn the rule made 0.2 errors, and 
those who had to learn both made 3.6 errors. Rule learning was a more 
important part of the acquisition of disjunctive and other complex classes 
because, although the attributes were the same, the combinatory rules were 
more complex, ‘On disjunctive classes, for example, those who had to learn 
the attributes made 12.1 errors, those who had to learn the rule made 8.0 
errors, and those who had the complete problem made 37.0 errors. Adolescents, 
in grades six and seven, are also capable of learning rules for conjunctive and 
disjunctive classes and of transferring their skill to new problems CYouniss 


and Furth, 1967). 

When the inquiry moves from 
many complex cases taken from syml 
conditional rule is particularly interesting : : 
A, then B. Su ppose the relevant attributes are square and red, with other 

I and the rule is: If square, then red. Square red 


at is, they fit the rule, and square figures that 
are not red are obviously negative. ie ele: dogs not: imply that figures that 
are not square must be red, so red figures of other shapes are posee SAGE 
and so are figures that are neither red nor square. The latter two instances are 


the ones that make the conditional rule a difficult one to learn. In the Hay- 
Sood and Bourne study, for example, those who had to learn the attributes of 


4 conditional problem made 26.8 errors 9% the average, those who had to learn 
the rule ma 16.4, and those who had the complete problem made 38.4 


errors, 


ve rule was so easy that it made 


attributes to rules for combining attributes, 
bolic logic can be investigated. The 


because it involves implication: If 


oe being irrelevant, 
gures are obviously positive, th 
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INSTRUCTIONS AND TRAINING 


Concept research has usually followed the tradition in experimental psv- 
keepi j i iti i > > duration 
chology of keeping the subject in a condition of innocence for the d 
of the experiment. In educational circles this condition is supposed to promote 
“learning by discovery.” But in fact when the subject is given detailed knowl- 
edge of the materials, the task, and the concept to be attained, his perform- 
ance is generally improved—though the reasons for the improvement may be 
obscure. Simply giving more explicit instructions and a brief demonstration 
improves performance even on the difficult conditional rule CLaughlin, 1969). 
These conditions can be experimentally manipulated, like other experimental 
conditions, to clarify the reasons for the improvement. 


PRETRAINING 

Much of the difficulty of the more complex cl 
(1967), is due to the inexperience of the subjects in making use of the in- 
formation provided. Teachers of logic have worked out several schemes for 
systematic representation of the information to be used in deductive reasoning; 
one such scheme, the truth table, can be used also in inductive reasoning, or 
concept and rule learning. Bourne gave some college students preliminary 
training by having them sort geometrical figures into two categories, positive 
and negative, on the basis of presence or absence of relevant attributes. 
Students who had pretraining on some simple problems developed strategies 
that helped them with more complex problems. In one study, for example, 
the trained group required only about a third as many trials as a comparable 
untrained group in order to attain the more complex rules of combination. 

We have seen that the diffculty in using negative versus positive informa- 
tion decreases with practice, and so does the difficulty of disjunctive versus 
conjunctive classes. Practice in mapping the available information into a 
truth table Cor into any systematic scheme for representing the dat: 
the logical use of the data and reduces the vulnerability 
one kind or another. If a subject represents all instances in 
for example, he is more likely to consider it a cl 
use the negative information along with the positive. For pretraining, Guy 
(1969) had children of seven and nine sort figures into a box with four 
panels, two by two, labeled according to the presence or absence of the positive 
values on two dimensions. Then the children were shown how to collapse the 
two-by-two table into just two categories, positive and negative. The pretrain- 
ing did reduce the differences between the difficulties of various rules of 
combination—for children as well as college students. Some aspects of the 
Problem became mechanical, so that the subjects could apply their conceptual 
abilities more efficiently to all types of problems. If we regard the following 
of a complex rule as a high-level task and the coding of figures into classes as 
a low-level component of this task, then practice in coding is a helpful first 
step toward the higher performance (Lee, 1968). 


The st i inati ini i 
most illuminating effect of pretraining, however, with a revealing 


asses, according to Bourne 


a) simplifies 
to interferences of 
a two-by-two table, 
osed system and be willing to 
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sidelight on concept learning theories, is the effect on the processes by which 
the concept problems are solved. It appears from an experiment by White 
(1967) that the activities of college students will be consistent with either an 
associational theory or a hypothesis-testing theory, depending on the pre- 
training they have received. White gave some students pretraining on an 
gures, assuming that these would be learned by a gradual 
h the appropriate responses. He gave other 
students a systematic set of figures, assuming that these would encourage 
students to look for the system and thus employ a discontinuous, hypothesis- 
sblem, which was the same for all subjects, 
ls on either systematic or unsystematic ma- 
ay. But 75 pretraining trials separated the 2 
groups of subjects on the test problem. Those with pretraining on unsystematic 
materials acquired the second concept gradually, as if by associative learning, 
while those with systematic materials acquired it in a discontinuous fashion, 
as if by hypothesis formulation and testing. As the author put it, “Hypothesis 
testing theories describe the behavior of sophisticated Ss, while associational 
theories have more applicability to the untrained S [p. 13].” 


unsystematic set of fi 
increase in correct associations wit 


testing approach. On the test pro 
subjects with 25 pretraining tria 
terials performed about the same w 


VERBAL LABELS À 
The concept learning experiment can be modi 
either to pursue the theoretical questions discussed above or to evaluate 
alternative classroom practices, by the technique called matching to a sample. 


Wittrock and Keislar (1965) projected slides with three figures varying in 


four dimensions: color, size, shape, and number. The sample figure or model 
appeared at the top of a slide, the other two figures at the bottom; the child's 
task was to choose the bottom figure that was most like the top figure. This 
is, in effect, a reception method of presentation, with all three instances shown 
Simultaneously, The top figure, for example, might be three small green 
Circles while the bottom left figure is two small green diamonds and the bot- 
tom right three large red circles. When color is the correct dimension, and ar 
shape, and number are irrelevant, the correct choice for this example wou 
be left. 

Wittrock and Keislar examined the effects of verbal hints or cues, given 
orally when the slides were shown, on helping children discover concepts and 
transfer their skill to mew concepts. For children in the second and third 
grades, general hints about concepts were useless but class cues, such as color, 
and specific cues, such as red, facilitated learning on transfer to new concepts 
as well as to additional instances of the same concept, and even on a follow- 
up test three weeks later. One might expect that general hints about concepts 
would be most helpful in the transfer to new concepts, but there was some 
evidence that the general hints merely confused the children. 

Similar slides, with simple line drawings of familiar objects such as trees, 
birds, and toys, were prepared by Stern (1965) for children in kindergarten 
and first grade. The top picture might be a bird, for example, and the three 
bottom pictures a toy, a tree, and a different bird. The children’s task was 


fied for young children, 
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to select one of the bottom pictures to “go with” the model above, and in this 
experiment the children were given labels for the instances, for example, 
robin. It was not a case of learning the labels as such; it was more a matter 
of using them conceptually, to apply to a class and thus to solve a problem. 
The same superiority of class names as verbal cues has been reported in an 
experiment on teaching French words for pictures to kindergarten children 
CWittrock, Keislar, and Stern, 1964). 

When complex rules are to be learned, one efficient procedure is to learn 
the components separately. The complex rule that Lee and Gagné (1969) 
taught to high-school students was the relation between certain A attributes, 
such as size and color, and certain C attributes, such as shape and texture. 
It is reasonable to suppose that the A-C tule is acquired in two stages: 
categorizing the A attributes and associating them with appropriate labels 
(the B components), then learning the B-C rule that relates the labels to the 
C attributes. A subject might learn, for example, “When a red large figure, 
choose the shape of the figure and read the value of the shape dimension.” 
The experimenters assumed that the B labels acted as verbal mediators in the 
chain between the A and C attributes. They supported this assumption by 
demonstrating that consistent training with the mediators facilitated acquisition 
of the A-C rule more than inconsistent training. The implication is that a 
concept is more likely to be integrated with others when it is properly labeled. 

These experiments leave no doubt about the generally helpful effects of 
verbal labels when appropriately applied to classes of figures. There are ex- 
ceptions, of course. Furth and Milgram (1965) reported that asking children 
to supply verbal labels for pictures facilitated correct classification at age nine 
but not at age six when verbal skill was less developed. And the controversy 
over mediated association and reversal shifts sketched in the preceding pages 
suggests that even when labels have proved helpful, there is room for argu- 


ment as to whether these effects are due to verbal mediation or to verbal 
control of attention. 


STRATEGY TRAINING 
Children’s problem-solvin 


8 processes also can be i red | train- 
; NEP 8 P an be improved by proper 
ing which is directly sugg l : 


y ested by the theoretical investigations of concept 

learning. One erro that child m 

i e r that children, as well as adults, often make is to repeat a 

E esis they have already tried. In technical terms, they sample cues by 
placement strategy when a nonreplacement strategy would be appropriate. 


Wittrock (1967) was able to train second-grade children, 12 minutes a day 
for thirteen days, ‘ 


probes wich on a nonreplacement strategy for simple figure concept 
with a with good success and with transfer to other problems. Children 
In 
Ng on a replacement Strategy were more successful than a control 
group, but not as su 


ccessful as those with nonreplacement training. 
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but no conclusive agreement. The more satisfying progress consists of the 
description of cognitive processes, such as hypothesis formulation and testing, 
focusing strategies, nonreplacement strategies, systematic representation of the 
information, and verbal labeling. Any comprehensive account of concept 
acquisition must include the alternatives and the flexibility: thus, verbal labels 
are useful but not always necessary; hypotheses are helpful, especially for 
sophisticated subjects, but others get by without them. A pluralistic description 
is more satisfying when one recalls that the conditions under which each of 
these activities occurs have been partially determined and so have their con- 
tributions to improvement in overall performance. Clearly, a whole theory 
of concept learning will be more than the sum of its interchangeable parts. 


LEARNING MEANINGS OF CONCRETE WORDS 

THROUGH INFORMAL COMMUNICATION 

The experiments reviewed above, analyzing complex abstract human ac- 
complishments, are a triumph of psychological methodology. They have been 
criticized, however, as experiments on artificial classes associated with artificial 
names. The advantages of such materials are obvious to an experimental psy- 
chologist, but the range of application of the results must be limited. Since the 
concepts that everyone acquires and uses are the standard concepts of the 
culture, such acquisition may be considered, at a gross level of analysis, as 
one aspect of the lifesize enterprise of learning the culture. If the aim is to 
investigate this extensive and tangled learning process, then culture-free 
concepts peculiarly suitable for research are no more to be hoped for than 
culture-free intelligence tests. 

And if the experiments described above are intended to yield generaliza- 
tions that apply to concept learning as it takes place outside the laboratory, 
they are open to another criticism. They have been designed, almost exclu- 
sively, as experiments in learning to classify. Hence the subject observes some 
objects and learns to give the class name as a response to the positive instances, 
and later a definition or rule of combination. Concept learning proceeds in 
this direction, to be sure, but it also proceeds the other way. The subject hears 
the class name or definition and then observes the objects to which it refers. 
Concept learning goes from words to things as well as from things to words. 

Suppose a man brings home a dax and shows it to his wife and five-year-old 
daughter, Later, he asks his daughter to get the new dax to show a visitor. If 
the child does so, she demonstrates that she has learned something about the 
meaning of dax, the extensional meaning at least. It is what her father refers 


to when he says dax. We need not assume that she likes or dislikes the dax 
£ ax. The reinforcement is contingent not on 


s response to her when she attempts to 
him the object that he calls a dax, she 


will receive social reward; if she brings him something else, she will be disap- 
Pointed. If she is rewarded in this situation, she may later be heard telling 


others that her father brought home a new dax. Reinforcement and nonre- 
inforcement of her attempts to use the new word in other situations also reside 


or makes any response to it as a d 
her response to it but on her father’ 
follow his instructions. If she brings 
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in the communicative process. In general, it is meanings, information about 
referents, that are communicated. E 

The meaning of a singular term is probably learned by simple asceiaban 
between the term, such as Daddy, and a perceptible referent. But even this 
is a polymorphous association: the child hears the word and then sces the 
referent of the word, he sees the referent and then hears the word, he sees 
the referent and speaks the word, he speaks the word and observes a rewarding 
response by the referent. 

The referents of a concrete general term, such as toy, are probably learned 
most often by extension or denotation, as when parents point to toys and 
tell the child to pick up “this toy,” and “this one too,” and “all the toys.” But 
intensional meanings or properties of the class are communicated also: “This 
is medicine. You have to take it to get well.” Children also hear many ad- 
jectives that refer to dimensions, such as heavy, red, and large, and soon use 
them with fair accuracy. These are the familiar dimensions from which 
synthetic classes are later constructed for laboratory research. 

Certainly positive instances occur more frequently in communication be- 
tween the parent and the child, but the occasional negative instances may be 
especially valuable. It is not uncommon for a parent to tell a child: “This is 
nota toy. It’s a sharp knife that can cut you.” Word usage is sharpened through 
continued two-way communication with more positive and negative in- 
stances, through deliberate correction by parents, and perhaps through 
tidicule by other children. The referents of so 
others are narrowed, Some singular concepts become general concepts, as when 
the child explores the neighborhood and finds that other children have daddies 
too. In all probability, adjectives that adults use to refer to bipolar dimensions, 


such as clean and dirty, are used as class concepts by children at first and only 
later as dimensional concepts. 


A well-known theor 


me terms are broadened while 


y of the acquisition of word meanings by the preschool 
child is the mediation theory of Osgood (1957), which begins with an un- 
learned or previously learned stimulus-response association, The taste and feel 
of warm milk in the mouth of a hungry 
sociated with swallowing, salivating, and digestive activities, Considering the 
stimulus pattern as the significate, Osgood states that when another stimulus 
is associated with this significate and the association is reinforced, the new 
stimulus acquires an increment of association with some fractional por 
tion of the total behavior elicited by the significate. Thus a visual stimulus, a 
bottle, can become a sign of the taste and feel of the milk because it can 
elicit a portion of the response to the milk. Likewise an auditory stimulus, the 


it elicits part of the manipulatory 
association generalizes to other balls 
ilar response patterns. The mediator 


bol is a fraction of the response to them. 
But words and other symbols, unlike signs, are produced as well as heard. 


After the child has acquired the skill to produce words like those he hears, the 


spoken word as well as the heard word is associated with the fractional re- 
sponse that represents the object or class of objects. 


infant, for example, are strongly as- 
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Since the fractional representational response has not been tied to distinc- 
tive experimental operations, discussion of this theory has been on the spec- 
ulative level. Lennenberg (1962) has pointed out that any theory of verbal 
comprehension that depends on speech movements is questionable because of 
the existence of children who learn to comprehend quite well without being 
able to talk. Fodor (1965), a linguist, is critical of psychologists’ theories that 
conceive of words or verbal cues acting as implicit stimuli and responses, 
because a word is not really a response in the sense that it is under stimulus 
control, Different words, synonyms, have approximately the same meaning, 
and the same word in different contexts has different meanings. Some words, 
such as dragon, have no corresponding stimuli. Thus fractional components 
of the response to a word as such are too unstable to support a theory of 
above of the girl learning the meaning of dax, it was 


meaning; in the example 
successful communication that was reinforced, not any particular response 


Calso see Osgood [1966]). There is general agreement that any theory of the 
A meanings must include the flexibility between 
words and referents in diferent contexts; the disagreement is whether a 
mediation theory of the S-R type, or any other, can be flexible enough to ac- 
count for the facts without being too abstract to be testable. But psycholinguis- 
tic research, inquiring about sentences as well as words, is aa ki 
amount and sophistication, hence it is not too maeh ta iope 2 hat we sha 
gain a deeper understanding of the acquisition of word mieanings: ; - 
At any rate, by the time he goes to school and enters a more formal edu- 
f = as learned to use many common terms to 


cation program, the average child h ei 
refer to classes and dimensions and to understand these terms as n 
such abstractions when others use them. It is a fair guess that some of the 


results of laboratory research on the learning of figure ames as n 
above, are applicable to this extralaboratory preschool learning, but the range 

ji a a : z š agë A aao 
of applicability is not known at present. At this young age gradual associative 
> ait nt, intuitively, than hypothesis formation and 


relation between words and 1 


learning seems more pertine 
feen 
esting. 


Achievement of Word Classes and 
Meanings from Other Words . EORNA 
classes is more typical of man’s distinctive intel- 
n the acquisition of figure classes: Word casses 
figure classes sg one e a is itself j pi 
fig -nension. But verbal and pictorial stimuli 
that refers to an abstract ae that what we know het figure classes 
a classes. In the Johnson and O'Reilly (1964) 
oe 57, children who had verbal phrases describ- 
g ver trials and gave better definitions 
orial dimensions. Compared to the 
analyze and to describe in verbal 


The acquisition of word 
lectual accomplishments tha 
seem to be more abstract than 


differ in many ways; We can 
can be directly applied to W 
experiment mentioned on pê e 
ing birds learned to classify them in fev : 
than children who had corresponding pe 
pictures, the verbal materials are easier to 
terms. 
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are simple classes, characterized by one element, and that this one ae 
characterizes the class because of its meaning, not its physical attributes. The 
subjects of Reed’s experiments, who were college students, knew the mean- 
ings of the critical words as well as the noncritical ones. The task was not v 
learn word meanings but to abstract the one critical word out of each group 0 
four and to generalize across four-word groups. 


THE PREPARATORY SET ; 

In learning word classes, as in learning figure classes, the set or preparation 
for the task, established usually by the instructions, makes an important con- 
tribution to efficiency. If a subject does not know that there are classes of 
words, he may simply associate one word with another. This may, in fact, 
be the most efficient procedure if the classes are hard to learn and there are 
only a few instances in each class. When studying French, for example, it is 
worthwhile learning the classes of regular verbs because there are only a few 
classes and many verbs in each; but there is also a small number of irregular 
verbs that must be separately memorized. The materials of the usual concept 
experiment are chosen so that acquisition of the classes facilitates performance. 
Under such conditions, a set to learn classes is more efficient than a set to 
learn correct associations and, unlike associations between nonsense syllables, 
these conceptual achievements are retained very well (Reed, 1946). 


METHOD OF PRESENTATION 

Simultaneous presentation has certain obvious adv 
Common serial presentation—for word classes as for figure classes. In a com- 
parison by Reed (1946), the simultaneous method led to the discovery of 
more correct concepts, presumably because when the subjects can study 
several examples of a class at once in their own way, they can easily detect 
similarities and check hypotheses. This advantage ‘did not hold, however, 
when so much material was displayed that the subject could not grasp it sim- 
ultaneously and had to fall back to a serial method. Furthermore, the subjects 


spent considerable time studying the words presented simultaneously, hence 
this method increased accuracy rather than speed. 


antages over the more 


ABSTRACTNESS 


The old question of the abstractness of concepts can be examined with 
word classes as with figure classes, but the 


s 2 materials for the former are perhaps 
easier to standardize. In an exper; 


high-frequency words equated for 
stract words, second, minute, week, 
ped under the label paf, and, as an 
alad, cereal, sugar, and cream were us 
bstract classes were clearly more diffi- 


cult for college students to learn, judged by maibe iff failures promptings 


required, and generalization errors. 


This finding is consistent with the recent research on verbal learning, 
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summarized by Paivio (1969), showing that concrete words are more easily 
learned than abstract words. Paivio's interpretation, that concrete words 
arouse more facilitative imagery than abstract words, is also pertinent to the 


learning of word classes. 


RELEVANT AND IRRELEVANT WORDS 

Irrelevant words interfere with the formation of word classes just as 
irrelevant dimensions interfere with the formation of figure classes. This 
interference has been demonstrated by a serial-exposure technique (Johnson 
and Hall, 1961) like that described earlier for figure classes. The subject 
is presented with a list of words, with instructions to observe what they 
have in common, then turn to a second list and find another word of the 
same class. Attaching a label to the class is not required. An illustrative 


problem follows: 


bark 1. birch 
salt 2. number 
pepper 3. wood 
cement 4. pot 
paprika 5. cloves 
thyme 


The important measurement, for present purposes, is the time spent on the 
first list. The number of irrelevant words contributes far more to the time 
required than the number of relevant words; problems with six relevant 
words, for example, required a median time of 3.3 seconds. With five rele- 
vant and one irrelevant words, median time was 3.6 seconds. In problems 
like the one given here, with four relevant and two irrelevant words, 6.3 
seconds was required. In general, the interfering effect of the irrelevant 
words was less when the number of relevant words was large. 

This method yields measures of the difficulty of forming classes more 
efficiently than conventional learning methods, but the experiment is open 
to the criticism that the associations between words were not standardized. 
An experiment of similar design by Mayzner (1962), but with better control 
of the materials, presented two, three, four, or five words in the F list, “i 
addition to the correct word. These problems were more difficult than the 
one above. Their solution was aided by an increase in the first list, which 
offered more associations with the correct response word, and impeded by 
an increase in the second list, which offered more competing associations. 


SIMILARITY OF MEANING a : 
previous section on salience and similarity in te EE si E 
classes noted that the dimension on which objects dilter 1s more obvious and 


therefore more useful as a cue to classification when p seine on that 
dimension are large. It is reasonable to ask if this piip e applies also to the 
achievement of wail classes where the dimensions 0 oe rere, — 
Sions of meaning. Taylor and Haygood (1968) investg E ag: 


à - ratings on the evaluation dim mE 
Selecting words that were close together in Ta g en 
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ji iffered, 
sion for comparison with words that were well separated. The words differ l 
i i A saluat vir results 
of course, in other semantic dimensions irrelevant to evaluation. Their result 
i i i 55 ede re 
showed that learning to classify 60 words into 2 classes proceeded mo 
i ae i i slatively easy to 
rapidly when the separation in meaning was large. It is relatively easy 
i i i i eani 1 ifferences 
classify words for their evaluative dimension of meaning when the a i 
is di i ere ine sords clos 
on this dimension are large. The same results were obtained for wor 
together and far apart on the potency dimension. 


ic di i i ivity—are mni- 
Three semantic dimensions—evaluation, potency, and activity—are prot 


nent in children’s classifications of words also (Di Vesta and Ingersoll, 1969). 
The evaluative dimension is basically salient, of course, hence fifth-grade chil- 
dren learn word classes rapidly if this dimension is the relevant one. But 
whatever semantic dimension is relevant, the le 
differences on the relevant dimension 
irrelevant dimensions, 


arning goes faster when the 
are larger than the differences on the 


COMPLEX CLASSES AND RULES 

The first experiments on word classes constructed simple classes character- 
ized by one dimension of meaning, but the fact that words can be rated on 
several dimensions of meaning makes it possible to construct complex classes 
as well. Di Vesta and Walls (1969) constructed multidimensional classes 
from words rated on the evaluation, activity, and potency dimensions, pre- 
sented these to fifth-grade children to learn, and found, as expected, that 
disjunctive classes required more trials for learning than comparable con- 
junctive classes. The expected differences between attrib 
learning, which has been demo 
shown in Fig. 2.5, 

Thus it appears that several of the 
out with figure classes apply also to 
are often strongly associated with oth 
are, as far as we know, peculiar to w 


ute learning and rule 
as 
nstrated for figure classes, also appeared, a 


generalizations which have been worked 
word classes. But words, unlike figures, 
her words, and this raises questions that 
ord classes, 


WORD CLASSES AND WO 


Reed departed from tradition 
tradition in naming the 
word classes are also wo 


RD ASSOCIATIONS 


in constructing classes of words but ile 
by nonsense syllables. When the names © 
rds, the responses as well as the stimuli are words, 
Previously learned associations may be aroused—to 
mation. The relevant and irrelevant dimen- 

of figure classes may differ in salience, as 
noted above; but the differences between words in terms of previous associa- 
more specific. This special nature of — 
ardization of research materials is needed. 
with verbal learning research and generates 


classes 


fruitful new hypotheses for both 


DOMINANCE OF ASSOCIATIONS 
Underwood and Richardson (1956) 


i i : 7jous 
investigated the influence of previov 
relationships by presenting their subj 


jects 
ects four names of common objec 
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15 a 
140 
130 
a—A Complete learning 
120 o—O Attribute learning 
Oo—D Rule learning 


Mean trails to criterion 


Problems 


FIGURE 2.5. Attribute learning and rule learning of word classes by fifth-grade children. 


F s 
rom Di Vesta and Walls (1969). 


hat single dimension, represented 
tuation like this, when the 
e is likely to respond with 
the stimulus word in the past (e.g. 
he hypothesis that concept learning 
word has previously been strongly 


associated with all four stimulus words a see en words of 
ifferent strengths of association to stimulus words had to be collected first. 


long list: of “nouns: was presented to college students: they were instructed 
put with the restriction that the 


to giy rath C 
give a free association word to ca Ay i i 
ptive of sense impressions, such as 


ver W 
hem. In a sl 
pecific set, hi 


Ceg, tomato), with a request to disco 
aie adjective, described all four of t 
a od has nothing else to go by, no s 
red) ta has been associated with 
© the experimenters began with t 


will oce $ F na 5 
occur quickly if a descriptive response 
ds. Hence, 


associ we i 
Ociated words must be adjectives descri i w Jenei 
Tound, small, and white. From this list four wores were selected, such as 
» @ p a . p 
arrel, doughnut, knob, and balloon, to which the word round was an asso- 


ciate of high dominance. That is, round was the response word given first 


74 % COGNITION AND CONCEPTS 


TABLE 2.3. EXAMPLES OF ASSOCIATIONS OF HIGH, MEDIUM, 
AND LOW DOMINANCE 


High Dominance Medium Dominance Low Dominance 

barrel bread baseball 

doughnut d (66%) flannel ft (42%) fang shite (12%) 
knob rout jellyfish Soft (42% ) paste white (129 
balloon moccasin sugar 


From Underwood and Richardson (1956). 


with an average frequency of 66 percent. Examples of descriptive response 
words of high, medium, and low dominance are displayed in Table 2.3 in 
order to show how such experimental materials are prepared. 

For the main experiment, other students were given 6 such groups in 
various combinations and instructed to describe 4 nouns by 1 adjective, 4 
other nouns by another adjective, and so on. Correct responses were con- 
firmed, The subjects easily learned to group words like barrel and balloon 
together and to label them by a response associate of high dominance level. 
Learning word groups with a common response word of low dominance was 


much harder, so much so that many of them were not learned even in 20 
trials. 


In these experiments, the process is not so much a learning of new asso- 


ciations as a shuffling and reorganization of previously learned associations 
of varying strengths, Competing associations play an interfering role some- 
what as the irrelevant attributes in the learning of names for synthetic classes 
of geometric figures do, If baseball is to be put in the class named white but 


; the learning will consist of inhibiting the strong 
s initially weak, Underwood and 
ssociative response hierarchy by 
. The weak strength of the class 


o £ Sponses were most often class names 

of high dominance. p 

n wee oe r “eee was demonstrated more explicitly by 
r an arasi rape 2 

sen ; cis >, who assembled word groups in which the 

irrelevant words varied in dominance and 

to learn were those in which the irrelevant 

and formed a class of their own that was easi] 

class. 

Over and above the associative dominance of 
Freedman and Mednick (1958), is the Variance of this dominance. In order 
to examine these associations more closely, they made UP two types of word 
groups. The first consisted of groups of four words, from the lists of Under- 


y differentiated from the correct 


class names, according to 
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wood and Richardson, each word being associated with a common adjective 
at the same dominance level; the second of groups of words that had the same 
average dominance level but contained one word of high dominance and 
thus had more variance in associative dominance. (An example of each type 
of word group is shown in Table 2.4.) The results of their experiment, fol- 
lowing the method of Underwood and Richardson, came out as expected: 
with average associative dominance equated, word groups of high variance 
in dominance were learned faster, with fewer errors, than those of low 
variance. The most frequent response to one word in a high-variance group 
is soon confirmed as a correct class name, and this gives the subject a cue or 
focus that helps him find others to which this name can be applied. 

A later experiment (Mednick and Halpern, 1962) tested the same prin- 
ciple, stated more precisely in terms of associative rank. The adjective metallic 
is a first-rank associate to pail because it occurs with higher frequency (24%) 
than any other adjective; the adjective soft is associated with the same fre- 
quency (24%) to belly, but it has second rank in this associative hierarchy 
because round has first rank (43%). Word groups containing a word with 
a first-rank associate were easier to learn than word groups of the same 


mean dominance without one. 
ill detect an analogy between the helpful effect 


contributed by a high variance of associations in the learning ol word classes 
and the focusing strategy in the learning of figure classes. A in is 
more than an analogy is the question. There is a difference in the k eling 
of the two interpretations, to be sure, which reflects the difference etween 
ative traditions. The notion of focusing implies 
e in formulating a hypothesis and testing it; 
the notion of associative dominance suggests that it is the stimulus word 
which elicits the response word. The subjects of word association experiments 
are not forbidden to formulate hypotheses but neither are the experiments 
designed to place them on record. Hence the available data do not reveal 
whether the acquisition of word classes in these experiments is continuous 


or discontinuous. 


The perceptive reader w 


the cognitive and the associ 
that the subject takes the initiativ 


TION 
MEDIATED VERBAL ASSOCIA e achievement of figure classes was 


The function of verbal mediators in th ; 
discussed earlier; their function in the achievement of word classes is more 
arlier; 


ND LOW VARIANCE OF DOMINANCE 
TABLE 2.4 s OF HIGH AND 
5 Fe ASSOCIATION 


sauerkraut 24% 


4 . 
oats oy hospital z smelly (23%) 
stone small (24%) — a1 
Canary 5 D 


From Freedman and Mednick (1958). 
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obvious and less controversial. In the learning of word classes, the first word 
associate is considered a mediator if it is not the correct response but is a cue 
to which the correct response is associated. (If the correct response word is 
directly associated with the stimulus word, the hypothesis of mediation is un- 
necessary.) The verbal mediator is usually conceived not as a complete re 
sponse word but as an implicit response having some of the properties of a 
word. Fortunately, there is considerable information on the properties of 
single words and of associations between words. 
used in designing the experiments on word cl 
in designing experiments on verbal mediation. 
on verbal learning is concerned with hypothetical associations and chains 
of associations between observable stimulus words and observable response 
words. The success of such hypothetical ex 
cation to the concept experiment. 

A recent experiment by Duncan (1965) 
college students learn which was the 


This information has been 
asses described above, and also 
In fact, much of the research 


planations has encouraged appli- 


illustrates this well. He had 
correct word on cach of 24 cards con- 
taining 3 words. The correct words were in pairs forming disjunctive classes: 
in one condition, for example, the correct cards contained either wait or 
give, the other words being irrelevant. This task w 
correct words were members of a familiar class, s 
and especially easy if the irrelevant words were not members of a familiar 
class. The mediator in this experiment could be cither member of a class, 
cirele or square, because such words are 
other and when one is reinforced, the 
also. 


as easy when the two 
uch as circle and square, 


commonly associated with cach 
occurrence of the other is strengthened 


Pairs of words vary more in associative strength than pairs of geometrical 
figures, hence the effects of verbal mediation suggest interesting hypotheses 
about concept shifts involving word classes. Kendler and Watson (1968) 
designed a concept shift experiment, like those of Kendler and others de- 
scribed in an earlier section but using classes of words as in the experiments 
Just mentioned. The important part of their results is that after the shift, the 
relearning was facilitated by high associative 
correct words and correct responses to the 
when the cues are words, it would not be 
is random. 


strength between previously 
new problems. In this situation, 
accurate to say that cue selection 


Similar effects of verbal mediation were den 
experiment by Schvaneveldt and Krol] (1968). If the subject has learned 
that groups of words containing the 


word horse are 
taining other animal words negative, the 


honstrated in a concept shift 


positive and those con- 


nin n a shift of reinforcement to groups 
of words containing cow would be an intradimensional shift, while a shift 


r mi w wi 4 i A . g 3. 
from milk to co would be an extradimensional shift. The results of figure 
concept research with adults suggests that intradimensional shifts would be 


lines, more so than figures, So 
ncluded in this experiment. A 
containing cow is facilitated by 


easier. But words are associated along many 
lists of high and low association were also ir 
shift from groups containing horse to groups 
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the high association between these two words, while a shift from rabbit to 
cow is not so facilitated. 

The results of this experiment indicated little difference between intra- 
s, but the difference due to the asso- 


dimensional and extradimensional shif 
ciative strength of the words was considerable. Shifts from one word to another 
strongly associated with it required about half as many trials as shifts to a 
weakly associated word. The authors take these results as evidence that con- 
cept shifts can be mediated by associations between the words and as a lack 
of evidence for second-order mediation by the dimension. Thus the fact 
that horse and cow as animals are on the same dimension, or in the same 
class, was evidently not as significant as the fact that cow is a close associate 
to horse but not to rabbit. 


LEARNING THE MEANINGS OF ABSTRACT TERMS 


The broad view of concepts proposed in this volume looks at a concept as 
a part of the general culture and at concept achievement as an aspect of 
socialization, It follows from the flexibility of the human mind and the versa- 
tility of concepts that concept learning will proceed along several routes. We 
have noted that young children can acquire a concept by beginning with per- 
ceived objects and learning the word that refers to them in ordinary domestic 
conversation or by beginning with the perceived word and learning what 
objects it refers to. But domestic communication refers to intangible classes 
and dimensions also. Children pick up such concepts as good, bad, gone, 
tomorrow, grow, love, and work from hearing these words in sentences with 
other words. This is usually called incidental rather than intentional learning. 

A child who hears an unfamiliar word in an interesting context may go to 


an adult or an older child and ask what the word means. When this happens, 


he may be given the denotative significance, through a few positive and 
accompanied by pointing. In the case of abstract 
of course, be described in words, and consider- 
ng is assumed. The child may even be given 
nding on the adult's skill and the child's 


negative instances, perhaps 
concepts the instances must, 
able previous conceptual learni 
a synonym or a definition, depe 


patience. 
] learn to read, their incidental learning 


When children go to school anc 
does not stop, but there is a big increase in their intentional learning, or 


study, a large share of which, at all educational levels, consists of the acquisi- 
tion of meanings for terms. The direct applicability of conventional concept 
learning research to concept teaching practice appears, however, to be quite 
limited (Carroll, 1964; Gagné, 1965) because the role of instructions from 
one person to another, in speech and in writing, has not received the study 
arch has been concerned with discovery learning, 
d, or “given,” to students by teachers and taser 
pts are acquired deductively as well as induc- 


it deserves. Concept rese: 
but concepts are also presente 
books. As Carroll put it, conce 
tively. 


An examination of the textbooks reveals that most writers employ several 
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techniques, though not systematically, in order to communicate a ape a 
cept. One common technique presents a definition first, then a è 
examples, and a negative example for differentiation; thereafter, the i 
is used in context to fixate it, Occasionally, a writer will introduce a new 
term in the context of a sentence, perhaps to arouse interest, before defining 
š S S ce a diffe i > se l and third 

The sequence of events does make a difference, in the second an ee 
grades at least. To demonstrate the superiority of a hypothetically optima 
; arene Eustace (1969) wrote a hierarchical program for teaching oa 
concept of noun, beginning with names of things and progressing to wes 
groups, a definition, position of nouns in sentences, and so on, through cig nt 
levels. This systematic order proved somewhat more efficient Gin terms of 
Scores on a posttest) than the same materials arranged in other orders. 

It is informative also to consider the case of the ordinary adult who wishes 
to classify the trees, rocks, or birds that he sees on a walk through the country- 


side. He has heard terms, such as oak 


X, sugar, maple, sandstone, and robin, 
and he has observed many colorful instances but, if he is serious, he will not 


try to attain concepts alone. He will get help, by oral communication from 
an expert if he is fortunate, or by written communication. Field manuals or 
guides to classification of birds, trees, minerals, and other natural phenomena 
are well worth psychological scrutiny, 

Such guides to classification usually begin by de 
features: the songs and wing structures of birds, th 
trees, and the color and specific gravity 
symbols are assigned to these 
reference. Then the classes—, 


scribing the distinguishing 
c leaf outlines and bark of 
of minerals, and so on. Standard 
perceptible features in order to facilitate verbal 


major divisions, subdivisions, and finer distinc- 
tions—are described on the basis of such attributes, and scientific names, 
together with local synonyms, associated with them. F 


eatures that are per- 
ceptually salient but irrelevant to the cl 


assification may be given special 
books do, that learning 
ation, facilitates learning 
eir names facilitates perception 
After study of the system, the rock- 
of any specimen will be mediated 
a term will depend 

» and he will communicate with 
other rockhounds in the te ; 


ACQUISITION OF MEANING 

In order to examine the d 
and Staats (1957) began w 
could acquire meaning by association with 
As the subject saw yof, he heard the word 
the word thief. In this way yof was associated with “good” words, such as 
beauty, sweet, honest, and smart, while xeh w 


as associated with “bad” words, 
such as thief, bitter, ugly, and sad. Other syllables and other scales «were ine 


BY VERBAL CONDITIONING 


evelopment of the meaning of a symbol, Staats 
ith nonsense syllab 


y lables, such as yof, to sce if they 
familiar w 


ords of known meaning. 
beauty. 


And he saw xeh, he heard 
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cluded in the experiment as controls. Later, when these subjects rated the 
syllables on the Pleasant-Unpleasant scale, yof was rated more pleasant than 
xeh. 

The same procedure was employed to give nonsense syllables more specific 
meaning (Staats, Staats, and Heard, 1961). Laj was associated with words 
that had previously been judged to be “angular,” such as square, box, roof, 
and triangle. Giw was associated with words that had been judged to be 
“round,” such as coil, globe, hub, and barrel. Later, when the subjects rated 
these svllables on several scales, laj was rated at the “angular” end of the 
Angular-Round scale and giw at the “round” end. 

Although these two experiments were designed to demonstrate classical 
conditioning of meaning, other interpretations are possible. When the adult 
subject’s awareness of the experimental contingencies and the demands of 
the social situation are scrutinized, it appears that this learning is frequently 
more than simple conditioning (Page, 1969; Staats, 1969). The criticism 
does not deny that children may learn meanings by simple conditioning in 
some circumstances; it is very difficult, however, to be sure that a college 
student learns by conditioning when he can diagnose the experimental ex- 


pectations and reach the same end in his own way. 


ACQUISITION OF MEANING FROM VERBAL CONTEXT 
Learning the meaning of a word from the context of the sentences in 
which it occurs is a common mode of acquisition; children acquire many 
ate this process, Werner and Kaplan (1950) 
| contexts for each. One task was to discover 


sentences such as the following: 


words in this way. To investig 
printed artificial words and severa 
the meaning of corplum from s 


A corplum may be used for support. 

Corplums may be used to close off an open place. 

A wet corplum does not burn [p. 3]. 

nted to children one at a time and each child 


artificial word that would fit all six sentences. 
were not yet capable of “de- 


The sentences were prese 
was asked for a meaning of the 


The younger children, as one would expect, 
unfamiliar word. The most childish attempts, by chil- 


fused the meaning of the artificial word with that 
of the whole sentence. Other children, a little older, could give a meaning 
that fitted the context of one sentence and later another that fitted the next 
sentence, Or, at a later age, they might give a meaning appropriate to two 
sentences but not a general meaning appropriate to all. Even at age ten, when 
children are using single words quite grammatically in their own speech, 
parts of the verbal context surrounding the artificial word crept into the 
meaning assigned to it. But by age eleven most children were able to make 


use of the context of all the sentences to derive a general meaning for the 


artificial word that was independent of any specific sentence. 
Learning from the context often lacks precision. Many objects that arc 
big, for example, are also heavy and strong. When a child hears one of these 


contextualizing” the 
dren around age eight, 
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adjectives in a sentence, the meaning he attaches to it may be the meaning 
that adults attach to one of the other adjectives. A differentiation of meaning 
comes slowly when the dimensions of meaning are correlated in the environ 
ment and discrepant instances are seldom encountered. Thus when opportuni- 
ties were created for children to make such confusions in describing simple 
objects (Ervin and Foster, 1960), strong was used where big would be correct 
with a frequency of 66 percent in the first grade and 21 percent in the 
sixth grade. In describing faces, pretty was used where clean was appropriate 
with a frequency of 91 percent in the first grade and 41 percent in the sixth 
grade. 

College students continually encounter new concepts, but a single en- 
counter is usually not enough for adequate learning. One of the carliest studies 
of context learning (Gibbons, 1940) presented such sentences as: 


In the beginning the teacher traveled from one locality to another to meet 
the students, thereby bringing into existence the 


[p. 29]. 


Only two-thirds of a sample of college freshmen were able to give the 
culturally accepted meaning for itinerant after reading 
other third gave such replies as a research schoolmaster. 
master, and a rural schoolmaster, 

The information for contextual le 


itinerant school master 


this sentence. The 
, an immoral school- 


arning comes from the other words and 
the sentence structure: but more than one sentence is necessary to strengthen 
the standard meaning and eliminate other meanings peculiar to a single 
sentence. In this respect, learning verbal concepts from a verbal context 
resembles learning figure concepts in the conventional laboratory experiment. 
A context learning experiment was arranged, therefore, with frequency of 
occurrence of each word in a sentence as the independent variable CWoolum 
and Johnson, 1969). Coherent paragraphs were constructed of sentences each 
containing 1 unfamiliar but genuine English 
saical, and amerce. The dependent variable was the score on a 5-item multi- 
ple-choice test of word meaning, hence the maximum score f 
would be 5. Under these conditions, 
faster than others, of course, but the 
achievement with an increase in fre 
exposure to a word the m 
2.3, 2 sentences 3.0, 4 se 


h word, such as posset, phari- 


or any concept 
some concepts 
ated a general increase in 
quency of occurrence in context. With no 
can score was 1.5, with ] sentence the mean was 
ntences 3.], 8 sentences 3.7, 12 sentences 3.9. 


college students acquire 
results indic 


Thus some of the subtleties of learning from 


: uld be quantita- 
tively investigated. It was possible, for exa 


constructing positive 


mple, by 
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and negative sentences of equivalent information, to verify the expectation, 
from previous work with figure concepts, that the subjects would acquire 
concepts more readily from the positive sentences. 

Other variables that are influential in the achievement of figure concepts, 
such as the subject’s set and the amount of irrelevant information, are no 
doubt influential in this situation also; but there are some other variables, 
such as the syntactic structure in which the new word appears, that are 
peculiar to verbal context. Prentice (1966) was able, in fact, to separate the 
syntactic aspects of the acquisition of word meaning from the semantic aspects 
by varying the training given to fourth-grade children. Semantic training 
was given by showing three pictures in association with a syllable, such as 
fud, referring to all three. The syntactic training consisted of presenting a 
syllable in a sentence that indicated little more than its grammatical function 
as a noun, adjective, verb, etc. So, when sim was to be an adjective, three 
sentences were presented, along the lines of: “This is a sim dog,” and “This 
is a sim house.” Later, the achievement of semantic meaning was measured 
by a test of word equivalents, and achievement of the syntactic function of 
cach syllable by a test of use in a sentence. As one might expect, those with 
semantic training made an average score of 12.9 on the semantic test and 
only 6.7 on the syntactic test; while those with syntactic training made 7.4 
on the semantic test and 10.1 on the syntactic test. It is especially interesting 
to note that those children who did not learn what sim referred to could still 
al place in a sentence. 
from the context are pertinent to the activity 
of a student trying to puzzle out the meaning of an unfamiliar term in a 
textbook but, like the experiments on learning figure concepts, they are less 
arning that children do for many years outside 
and college are accustomed to formulating a 
and testing it on later instances. 
s, which are less efficient but 


put it in its proper grammatic 
These experiments on learning 


pertinent to the incidental le 

school. Students in high school 

hypothesis about the meaning of a word 

Young children depend on associative processe: 

nevertheless adequate for le ning of words that they hear very 
SS a h ¢ 


frequently in many contexts. 


arning the mea 


SYNONYMS 
Tf we try to look up the me : : ) 
synonym rather than a definition. Teaching materials frequently include 


a introducing an unfamiliar concept, perhaps with an 
And the conventional multiple-choice 
than a comparison of words to see 


aning of a word in a dictionary, we often find a 
aning y 


familiar synonyms wher 
explanation of the variations in usage. 
vocabulary test is often nothing more 


which one is most similar in meaning to another. 
has ignored synonyms, but we may speculate that the 


tively to refer to a class of objects would complicate 
ed context. Or let us suppose, in a conventional con- 
cept learning experiment, that the labeling of a class of objects by either of two 
terms was confirmed. By analogy with disjunctive classification, we would 
expect that this disjunctive labeling would retard concept attainment. 


Concept research 
use of two terms alterna 
learning from oral or print 


> — 
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There are also conditions under which the availability of synonyms helps 

concept learning; at least, explanations of meaning often include one or two 
Tr 3 i 

synonyms. Such explanations seem to be based on the assumption that all 
5 5 ani ent i a at cach 
synonyms have a general meaning component in common and sy X 
has a special component. By this assumption, as synonyms are added to the 
context, the general component is strengthened while the special components 
cancel each other out. 


A COMPARATIVE EVALUATION 

A recent experiment treated several of these methods of communicating 
concepts together in one experiment for purposes of evaluation (Johnson 
and Stratton, 1966). The meanings of four words, moderately dificult for 
college freshmen, were to be learned: alacrity, altercation, chide, and opulent. 
One group was given definitions of the words, then told to rewrite the 
definitions in their own language. Another group read a simple story in 
which each word appeared twice, then used each to complete a sentence. A 
third group was given practice in classifying short descriptions of events as 
positive and negative instances of these four concepts, with knowledge of 
results. A fourth had practice in identifying synonyms for the: 
also with knowledge of results. And a fifth group had a mixed program, 
including a little of each method, requiring the same overall time, about 
20 minutes. Nine days later, all five training groups and a control group took 
a battery of four different tests constructed so as to correspond to the four 
different training methods, 

In their total score on all tests, all the groups improved as compared to the 
control group with no training; but the group w 
the most. In fact, all groups exceeded the control group on each of the four 
different tests. The four training methods were equally effective; only the 
mixed training surpassed the others. The question of transfer was phrased as 
follows: How much of what was learned by one method transferred to the 
tests corresponding to the other methods? And in this experiment, unlike 
most others, the answer was that transfer was 100 percent. For example, 
the group that had practice on definitions did as well on the classification 
test as they did on the definition test. Apparently these college students, 


who were accustomed to studying concepts, did not stop at learning specifics; 
they learned concepts. 


se four words, 


ith mixed training improved 


Achievement of Mathematical Concepts 


Number classes differ from figure and word classes in some 
the investigation of numerical achievements has consequently taken some- 
what different paths. Nevertheless, the number concepts af the preschool 
child in our culture can be described in the same terms as the others. Number 
classes, to begin with, are relational classes: what two dogs and two blocks 
have in common is not a directly perceptible feature but a numerical relation 


respects, and 
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between perceptible features. The child learns these abstract relational 
properties of perceived objects and the symbols, such as one and three, that 
refer to them. By the time he goes to school the average child, when shown 
sets of two or three objects, can give their number and, when asked for two 
or three objects, can select that many from a pile CRussell, 1956). The 
troublesome question of definition does not arise in this context. If a child 
can pass both of these tests, he has the concept three—by any reasonable 
definition. 

Since young children learn so much in diversified domestic interaction, 
recording what has been learned at different ages is easier than manipulating 
the conditions of learning. The simple number terms are probably learned 
at first just as other terms are. The child hears “one apple” as he hears “an 
apple” and “the apple,” often accompanied by pointing or handling. The 
same terms are applied in the same way to many other objects, associated 
with singular nouns. Probably the concept two is learned from the context 
as one and another one, along with both and a pair, in association with 
plural nouns. This should be rather easy in connection with dressing since 
the child has two feet and two shoes to put on them, and such words are 
used often in many different sentences in situations where the meaning is 
closely related to other cues in the sentence or accompanying behavior. ‘The 
abstractness of numbers is balanced by the circumstance that there are only 
a few small numbers and these are used with high frequency in situations 
where the association between term and referent is easily observed. Even 
such terms as none and all gone occur frequently in ordinary communication 
and in games, hence the association between these terms and their referents 
should be relatively easy to learn. The empty class may be very abstract but 
the distinction berncen a goodie, of any kind. and nothing is one of the 


salient facts of early life. 


Differences between one and none and between one and two as classes 


or sets are salient to the young child, but above two such differences are 
less obvious. In the short-run activities of the young child, the instrumental 
value of a set of four objects does not differ much from that of a set of three 
objects; likewise there is usually nothing in the context of communication 
to differentiate sentences that refer to four peanuts from sentences that refer 
to three peanuts. One study of children’s word usage in sini CHor; 
1929) recorded sharply decreasing frequencies for m n Pa ve ae 
3,678, two 1,252, three 626, four 443, and five 277. ge dats in the 
first grade showed a similar decrease (Rinsland, 1945). ifference be- 
tween one and many objects is probably perceived before the corresponding 
symbols are leamed; but the symbols for the largest numbers are learned by 


i E z ey are used to refer to perceived sets 
rote as a meaningless series long before they P ae 


of objects (Russell, 1956)- Herd f y hi 
The methods of investigating the number classes of young children may 


be illustrated by a study by Long and Welch (1941) that deliberately 
oe ee In their number discrimination test, one group of 
ten-marbles was displayed in a box and another group of five marbles was 
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displayed in another box. The child was told to choose the box with the 
most marbles and he would find a toy under it. If he failed, he was offered a 
choice between ten and four; if he succeeded, he was required to choose be- 
tween ten and six, and so on, up to ten and eight. In the number matching 
test, two marbles were placed in front of the child. He was handed a supply 
of marbles and told to make a pile himself, alongside these two, that was 
“Just as big.” If he succeeded, three were placed in front of him, to be 
matched. In the group matching test, the child w shown four groups of 
marbles—two groups of two each and two groups of three each—and asked 
to point first to the groups containing the most marbles and then to the 
groups containing the smallest number. Children of four years got a mean 
score of 5.6 on number discrimination; they could discriminate five or six 
from ten. The mean score on number matching was 3.5; 
2.6. By age six, these concepts had been quite well mastered up to numbers 


as large as seven. In general, the children who did well on one test did well 
on the other two tests, a correlation which 


in the conceptual achievement unde 
tests. 

Since “the new math” introduces sets at 
concepts by young children takes on especia 
designated by numbers, they are not 
classes of the relational type. Practice with sets is olten introduced as a kind 
of stimulus predifferentiation, however, to help the children discriminate 
the sets that numbers refer to, Suppes (1965) and others presented firsavade 
children with 2 sets, side by side, of simple designs feck we a star and a 
circle or 3 circles) and told them to push one button if the ecm: ete “the 
same” and another button if they were “not the same.” Won responses 
were corrected, according to the definition of the concept tes be le: ine l esor 
ing the identity of 2 ordered sets was very eas aloe die Saint nee for 
example, in the order AB and AB, about 95 percent of the responses in. the 
a block of 8 trials were “the same.” Sets that were not AA were more 
difficult: en unordered sets contained the same clements, as represented 
by AB ma BA, and “not the same” was reinforced, only about 70 percent of 
the responses on the first block of trials were correct. But both achievements 
came up to almost 100 percent in 56 trials. Perhaps concrete perception 


suffices for the former; some abstraction and : $ ; 
r > and manipulation 3 ortainly neces- 
sary for the latter. pula are certainly 


The concept of identity of sets, disreoz 
g 


arding the ¢ - was 
. è s rder of the elements, was 
somewhat more difficult, but the most interest -aa 


identity was easier than identity, When the set 
detection of nonidentity began in the first block 
but when they were such as AB and BA and the identity response was re- 
inforced, the identity response in the first block was less das 50 percent. 
Perhaps with these sets the nonidentity task requires less ahalbi than 
the identity task, but in any case achievement on both tasks approached 100 
percent in 56 trials. 


on group matching 


points to some degree of unity 
rlying performance on the three different 


a tender age, the learning of set 
l interest. When the sets are not 
in fact number concepts but figure 


Mg outcome was that non- 
S were such as AB and CD, 
of trials, at about 90 percent; 
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Equipollent sets are the hardest to learn. These are, in Suppes’s terminology, 
sets of the same number, such as ABC and BDE, regardless of the elements 
or their order. Nonequipollence, as between AB and CDA, was detected 
readily, but equipollence, without the help of identity of elements, came 
rather slowly. In general, set concepts differed markedly in their difficulty, 
depending on the definition of the set; furthermore, the amount of transfer 
from the learning of one set to the learning of another was surprisingly small. 
In fact, negative transfer was common, as in learning the identity of un- 
ordered sets after training on ordered sets. In general, the learning curves for 
such experiments fit all-or-none assumptions rather than incremental assump- 
tions CSuppes, 1966). 

There are only a few number concepts to be learned, but complex rules 
for grouping objects enter into mathematics as well as logic. In order to 
examine the strategies for achieving such rules, Dienes and Jeeves (1965) 
had schoolchildren and adults learn the systems of relations that mathema- 
ticians call two-groups and four-groups. These are genuine mathematical 
structures, but they have the advantage for experimental purposes that they 
are unfamiliar to most people, and adults as well as children have to start 
from the beginning. A two-group, for example, has two elements, such as 
cards A and B. The experimenter displays A or B in the window of an 
exposure device, then the subject plays A or B and guesses which will appear 
next in the window. The rules for the structure of one problem are: A 
followed by A yields A; AB —> B; BA => B; BB > A. 

Observation of the subjects’ moves and comments indicated that some 
tried to memorize all the combinations of events; others treated all the cards 
on the same level and tried to find patterns in them. The third strategy was 
to regard the card played as operating on the card in the window, as having 
the power to alter it. This was the most effective and was most often em- 
ployed by the adults. Even though these mathematical structures seem rather 
difficult, presenting them in the complex-to-simple order, that is, from the 
four-group to the two-group, yielded more explicit statements of the rules 
than the simple-to-complex order. The latter order allows the subject to get 
started just by memorizing instances, but evidently delays the search for the 
underlying structure. AET 

Learning names for sets of different numbers is more than a discrimination 
task and also more than a one-to-one naming task because the numbers 
are arranged in an order. Children learn to say their numbers in ig by 
rote, as they learn to chant their ABCs: “One, two, buckle your shoe; three, 
four, shut ‘the door. . . .” This is easy serial association. Enumeration or 
counting, which requires coordinating the verbal behavior perp tel 
grouping, is more difficult; the abstract numerical properties ons j s 
become more definite through the activity of counting j as 8 e heey 
become more definite as words are applied to them. But a B a oe 
ment of small numbers takes place in so many contexts thi iffcult to 


ied i tory- 
stu i n be studied in the laboratory 
dy, counting ca p requires the perceptual segregation 
Counting a limited number of objects req 
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of the counted set from the uncounted set, so Beckwith and Restle (1966) 
raised the question whether the principles of perceptual grouping that 
Gestalt psychologists have emphasized might have any function in this 
counting process. A line of 18 symbols printed on a page, for example, has 
continuity; hence if one counts from left to right, the set that has been 
counted is easily grouped and separated from the set to be counted. A circle 
has continuity but no ends; hence separation of the counted from the un- 
counted might be more difficult. Reading skills are involved, of course, but 
the counting task goes beyond mere reading. 


Experiments with school children and college students disclosed that small 
printed figures were more easily counted when arranged in rectangles. ‘The 
linear arrangement was also easy. A circle was quite difficult, and counting 
time was longest when the figures were scrambled. Similarities of color and 
shape were also important. The facility of counting the rectangular arrange- 
ments was related to the choice of numbers with fairly equal factors: 12, 15, 
16, 18. The subjects, especially the college students, took short-cuts when 
possible, counting subgroups based on color, shape, or size of a rectangle. 
then multiplying. 

The progression from the perceptual to the abstract is nowhere more im- 
pressive than in mathematical achievement. The first few natural numbers 
are learned concretely by preschool children, but achievement then moves 
from these small numbers to larger numbers that have less concrete signifi- 
cance and from number concepts to numerical operations. Children in school 
learn higher-order classes of numbers—such as even numbers, perfect squares, 
and prime numbers—but mostly they learn the relations between numbers 
and numerical operations like addition and long division. It is this abstract- 
ness of number classes and numerical operations that has led many teachers, 
from time to time, to introduce concrete construction materials to facilitate 
mastery, arranging them, or letting the child arrange them, in order to make 
the abstract properties more apparent (Bruner and Kenney, 1965). Such 
tricks are useful in the acquisition of figure and word classes also, but not as 
much as in the acquisition of numerical relations. The hardest task for school- 


children is arithmetic problem solving, to which we shall return in later 
chapters. 


Other Cognitive Structures 


Viewed structurally, a class concept is rather simple: certain things are 
: ‘ i a gs 
included, others excluded. A dimensional concept is 


w Pai a also a simple structure. 
A taxonomy for classifying birds is a more comple 


: s X cognitive structure, as is 
the structure of a university or a state government. But students do learn the 


more complex structures in one way or another, as is indicated by an improve- 


ment in the ability to assign an object to its proper place in the structure, 
to describe the structure, to communicate with others in terms of this struc- 


ture, and to solve problems involving it. 
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The achievement of a complex structure of knowledge, in the arts and 
sciences, is a difficult task that requires study; it is much easier to arrange 
objects, especially people, in a simple linear order or hierarchy, or in simple 
groups, as suggested by G stalt principles of organization. The argument for 


treating a simple structure as a “good figure” comes from the preference 


shown in ordering objects and from errors made in learning social structures 
of different degrees of complexity (Henley, Horsfall, and De Soto, 1969). 
The more dificult, complex social structures can be learned, of course, but, 
like disjunctive figure concepts, the learning goes slower. 

Extrapolating from the research on figure classes, we can assume that 
achievement of classes of people will be influenced by salient features: the 
effects of skin color, hair styles, and clothes on the formation of stereotypes 
are obvious examples of this principle. We can assume too that the achieve- 
ment of classes will be influenced by labeling, and the effect of labels for 
occupational groups on stereotyping illustrates this tendency. The weak 
influence of negative information as opposed to positive information also 
applies to stereotypes. 

The self i concept of special interest to each individual and to the 
psychology of personality. Each individual’s concept of himself is a singular 
concept, as I and me are singular terms that refer to it, but the self is multi- 
dimensional. Each individual can assign himself to classes, such as nationality, 
sex, and religion, and can locate himself on many dimensions, such as height, 
, aspiration level, and acceptance by others. Thus the 


age, economic status 
concept of self defines the individual's position, as he conceives it, in various 


overlapping social structures s 
In addition to the cognitive structures so far mentioned, which for the 
and labeled, each individual’s knowledge 
also includes cognitive structures that are implicit and unlabeled. Some of 
b . . 
these may not be represented by words but rather by spatial images or 
diagrams. Such idiosyncratic structures are not very useful in communica- 
tion, but some (e.g., implicit theories of personality) may be critical to the 
private life and thought of the individual. 
Explanatory concepts and principles are cognitive structures with a specific 
role in problem solving; they will be given due consideration as such in a 


later chapter. 


most part are explicitly defined 


Major Trends and Implications 


N 
he concept ranks as one of amsn d witt : 
functional unit of abstract thought associated with a conventional concrete 


symbol that extends man’s powers of classification, communication, and 
problem solving far beyond those of other animals. In accordance with its 
several functions, a concept may be acquired in several ways. The develop- 
ment, during the past few decades, of objective methods for empirical attack 
on the concept constituted, therefore, a victory for experimental psychology. 


most effective inventions, a multi- 
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It was a minor victory, however, because the methods were adapted from the 
study of discrimination learning in nonliterate animals and were limited to 
the classification Function. In justification of this restricted methodology the 
concept, as teachers, philosophers, and other scholars understood it, was re 
defined as a common response to diverse objects. Thus, the early experiments 
consisted merely of learning to make one response to positive instances of a 
class and another response to other objects. 

The questions asked by educational, social, and developmental psychology 
about the class and dimensional concepts current in the culture were not 
answered by this limited approach; hence more varied approaches and more 
realistic definitions have recently appeared. Many observers have noted that 
children often learn the word first, then the class or dimension to w hich it 

refers. Children, and adults also, frequently acquire a concept from the con- 
text of a sentence, from an explanation, or from a dictionary definition. The 
diverse functions of a concept in intellectual activity suggest diverse ways of 
acquisition and diverse measures of achievement. Whatever unity a standard 
concept has for an individual, as measured by his performance on different 
tests of concept achievement, depends on the consistency of his experiences 
and ultimately on the consistency of usage within the culture. The concept 
is a part of the culture as well as an achievement of the individual, and con- 
cept achievement an intellectual aspect of the socialization of the individual. 
A large share of recent concept research has focused on the learning of 
figure classes synthesized from familiar dimensions—more because of the con- 
venience of the materials and the hope of generalizing the findings than 
because of the urgency of this particular problem. In spite of variations in 
details, some major trends are clear: unidimensional classes are casier to learn 
than multidimensional ones, and among the latter, conjunctive cl 
easier than relational and disjunctive. Positive instances are more informative 


sses are 


than negative for learning conjunctive classes, but negative instances are more 
informative for disjunctive classes. Learning is influenced by the number of 
irrelevant dimensions that are varied as well as the number of relevant dimen- 
sions, but it goes faster when the relevant dimension is salient. Although such 
effects can be stated one at a time, in complex situations interactions are com- 
mon. 

Stimulus-response associationism, accepting the common-response definition 
of a concept, can theoretically account for concept learning quite simply be 
cause the common feature of positive instances of a class is always associated 
with the response that is reinforced, while the irrelevant features are reinforced 
only randomly. The limitations of this theory in complex situations led to 
mediated associationism, with hypothetical verbal mediators between stimuli 
and responses. The reversal experiment was introduced to validate the media- 
tion hypothesis, but interpretations have not been uniform. The results of 
certain kinds of verbal training support the hypothesis of verbal mediation. 
put these results could also come from learning to attend or from perceptual 
differentiation. 

The principal alternative to the associationist view of concept learning sees 
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the thinker in a more active, exploratory role, selecting promising cues ¢ 
testing hy potheses. Whereas the associationist view predicts a cot, Saran on 
crease in frequency of correct responses, the hypothesis-testing view aea 
an abrupt improvement when the correct attributes are identified Su ee i 
evidence of all-or-none learning has been reported in some sim Je i et 
multidimensional concepts allow more opportunity for Be A m EN 
, When complex classes are to be acquired, it is fruitful to n identi 
fication of the relevant attributes from identification of the rule of combinatior 
of the attributes. The initial difficulty in grasping complex classes and sales is 
reduced considerably by appropriate instructions and training. Suggestions re 
materials and for training procedures have come from abolie koi ete 
concept learning of this type resembles logical reasoning more than discrimina 
tion learning. 
The model builder is the romantic of psychology. dreaming of an elegant 
solution to the messy problems of psychological research. But no single model 
of concept achievement has consistently generated more substantial dreams, 
when exposed to the light of continuing empirical investigation, than the 
others. Determining the conditions under which associative learning and 
contribution of each to the overall perform- 


hypothesis testing occur and the 
rguing for one 


ance is a more sophisticated theoretical enterprise than a 
against the other. The evidence already available suggests that when the sub- 
ject does not know what he is doing, associative learning is a good frst ap- 
proximation. When he is well informed about the task and the materials to be 
conceptualized or when he has had appropriate pretraining, the hypothesis- 
for the observed activities. But even when 
except under 
d in the as- 


testing model accounts better 
testing hypotheses the subject does not select cues at random 
rather restricted conditions; variations in the salience of cues an 
sociative strength between the correct cue and others have to be included in 
the model. In this theoretical context, pluralism is realism. 

Experiments on the learning of word classes are especially informative 
because the effects of word associations can be examined. Several of the 
learning of figure classes also influence the learn- 
ing of word classes in about the same way. In addition there is convincing 
evidence that certain strong prior associations help and others impede the 
formation of word classes; under some conditions these effects can be predicted 
in advance. Knowledge of the structure of word associations is clarifying some 
aspects of cognitive organization just as cognitive organization, in exchange, 
is clarifying some aspects of verbal learning. ; 

Abstract concepts are acquired through relationships with previously learned 
concepts. It is now necessary for concept research and psycholinguistics to 
pool their techniques in order to describe how a reader, or listener, acquires 
the meaning of a word from many sentence contexts. College students, 
especially, are accustomed to studying concepts communicated by a variety of 
methods in books and lectures, and they can transfer their achievements to 


other situations quite flexibly. 
Research on elementary nu™ 


variables that influence the 


ber concepts is not vulnerable to some of the 
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criticisms that have been directed toward research on figure concepts. There 
is no uncertainty about definition, because when a child discriminates and 
matches groups of objects according to an abstract numerical property, such 
as threeness, in spite of differences in other properties, and uses the con- 
ventional numerical symbol to refer to groups that have this numerical 
property, he is demonstrating achievement of the concept by any reasonable 
definition. And there is no argument about the applicability of the results 
because the concepts of three, four, five, and so on are not concepts synthesized 
to simulate something else; they are the standard concepts that children learn. 
In spite of their abstractness, they are acquired incidentally in ordinary 
domestic conversation, like the other basic concepts, probably because there 
are so few of them to learn. All intellectual deve 
cours 


lopment is cumulative, of 
; but this generalization is most obvious for mathematical 


achievement, 
because elementary number concepts are prerequisite for arithmetic manip- 
ulations and arithmetic manipulations in turn are prerequisite for the ab- 
stractions of higher mathematics. 

Man’s perceptual operations are adapted for 
environment, rather than events in the sense 


ception of objects. Analogously, 


perceiving objects in the 
organs that mediate the per 
man’s intellectual operations are directed 
toward the concepts that have become significant to him, rather than the 
particulars through which they were acquired. 
in the particulars of the acquisition processes; 


he psychologist is interested 
the thinker forgets the partic- 
ulars but retains the concepts as instrume 


nts for communication and problem 
solving in the future. 


CHAPTER 3 


COGNITION 
AND CONCEPTS 
CONTINUED 


Individual Differences 
Cognitive Development 
Language and Thought 

Major Trends and Implications 


F psychology were only a search for general 
laws, the experimental evidence assembled 
in the preceding chapter would serve as an 
j introduction to our topic. But there is evi- 
dence in all experiments of individual variations in 
performance. These may be quantitative variations in the 
parameters of the general laws: some subjects learn con- 
cepts faster than others; some attain a higher terminal 
level of achievement than others. Experimental psychol- 
ogists would be happy if all these individual differences 
could be explained as differences in parameters of general 
laws, but this is an oversimplified hope. The present 
aust consider individual differences along lines 


chapter n 
that have not entered into concept experiments, as well as 


those that have. The largest individual differences depend 
of course; hence we shall compare children of 


on age, 
various ages and try to trace the course of cognitive de- 
velopment. Since this is intertwined with language 


ast section will be devoted to questions 


development, the | 
thought. 


about language and 


Individual Differences 
The most interesting differences between people are 
those that remain stable over a period of time; and al- 
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though most concept experiments are not directed toward this, evidence of 
stability is occasionally reported. Some consistency in concept attainment 
strategies from problem to problem has been found (Eifermann, 1965). 
There is also some consistency across problem materials, for those who 
worked a figure concept problem conceptually, rather than nonconceptually, 
did the same on a verbal concept problem (Segel and Johnson, 1969). But 
many of the interesting variations between subjects’ activities that show up in 
a concept experiment are transitory, specific to the situation, and unlikely 
to show up again. 


VARIATIONS IN RESPONSE TO OBJECT PROPERTIES 

People do not approach a concept problem without bias. A property of the 
stimulus objects that impresses one person as an obvious basis for conceptuali- 
zation may not impress another person at all. In the case of children, responses 
to object properties shift markedly with age. 


PREFERENCES FOR DIMENSIONS 

When young children are allowed to sort multidimensional objects into a 
few groups or categories in their own way and when they choose alternative 
attributes to match a model, they usually exhibit a preference for matching 
by color. After age five or so they prefer to match by form, One comparison 
of form and color, for example, found a slight preference for form (45 form 
choices out of 72) at age five, increasing to a strong preference (69 out of 
72) at age nine (Harris, Schaller, and Mitler, 1970). The salience of the 
dimensions can be experimentally manipulated, of course: increasing bright- 
ness differences increases the frequency of matching by color rather than 
form (Corah and Gross, 1967). f ii 

The next question is whether these preferences influence performance on a 
concept attainment task. To answer this, the subjects must first indicate their 
preferences by sorting in their own way, and then learn to sort according to 
the dimension specified by the experimenter. When college students learned to 
categorize pairs of dots, their initial preferences for dimensions had no effect 
(Imai and Garner, 1965); but the dimensions in this experiment were the 
position and orientation of the dots, dimensions that do not seem to be as 
compelling as form and color are to children. The performance of children in 
nursery school and kindergarten on a simple concept task was facilitated 


when the critical dimension corresponded to their preferences for form or 
color (Suchman and Trabasso, 1966). 


There are methodological difficulties with this type of experiment, however- 
A later experiment gave a preference test to children in kindergarten, first. 
and third grades, followed by the Wisconsin Card Sorting “Test, which in- 
cludes color, form, and number, as the concept task (Mitler and Harris, 
1969). The expected results were obtained, most subjects preferring the form 
dimension and then learning the form concept rapidly. But when other chil- 
dren were given the two tasks in the reverse order, quite different results were 
obtained. We see again that children are impressionable, like adults: both 
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their dimension preferences and their concept achievement were influenced 
by the immediately preceding activity. So the question of preferences and 
concept achievement has not yet been completely answered. 

If we turn now to verbal materials and ask how children classify words 
when they are free to choose their own categories, the most interesting re- 
sult is an increase in the use of classes based on grammatical attributes. In 
one developmental study, for example, adjectives were grouped with other 
adjectives 20 percent of the time in the second grade, 30 percent in the 
fourth grade, and 41 percent in the eighth grade (Mandler and Stephens, 


1967). 


REPRESENTATION OF OBJECTS 

The use of more lifelike objects in concept experiments brings up the 
question of mode of representation. When line drawings representing trees 
and other familiar objects, varying in form and color, were presented to 
children for matching to a sample, 95 out of 120 preferred to match on the 
basis of meaningful representation (Bearison and Sigel, 1968). At this age, 
from seven to eleven, meaning dominated form and color. In fact, earlier 
research (Sigel, 1953) had indicated that middle-class boys of this age would 


categorize by meaning whether they were categorizing objects, pictures, or 


words. This is not the case for all children, however. Lower-class children of 
l objects (such as a cup, a 


five and six categorized actual three-dimensiona 
pencil, and a pipe) just as the middle-class children did, but they had more 


difficulty in categorizing photographs of these objects—even though the — 
graphs were realistic, lifesize, and in color (Sigel, 1968). Research of this 
kind is beginning to specify the environmental conditions that produce con- 


ceptual deficiencies in disadvantaged children. 

CA later section will take up the question o 
ingful cognitive structures and symbols for repr 
of his environment, since this pursuit of mean 
of the story of cognitive development.) 


f how the child acquires mean- 
esenting the objects and events 
ing is one of the main themes 


CATEGORICAL STYLES è r 7 7 7 , 
When children and adults sort meaningful objects in their own way, by 


free classification, variations in preferences for different gps a a 
can be distinguished. Kagan, Moss, and Sigel (1963) were able to describe 


three such categorical styles as they appeared in adults groupings of 
pictures of people. The analytic style tends to abstract an sire objective 
similarity shared by two or more figures, for example, “People with no shoes 
on.” The inferential style involves an inference about the patterns grouped 
together, for example, “People who help others. And the relational style pre- 
fers to group objects on the basis of a functional relation between them, such 
as “A married couple.” Similar studies of children showed a fair individual con- 
sistency of cognitive style over one-year intervals, although the preference for 
the analytic approach did increase somewhat. ; 

Whether these are the best names for such Sorting tendencies may be 
argued, but cognitive styles do make a difference in concept achievement 
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CLee, Kagan, and Rabson, 1963). Third-grade boys characterized by the 
analytic approach learned analytic concepts more readily than others, and 
the nonanalytic boys learned relational concepts more readily. Since the 
category style tests and concept learning tests were given several weeks apart, 
the criticism noted above (that the first test may influence performance on 
the second) probably does not apply. 


COGNITIVE ABILITIES 

It has long been known that the achievement of concepts is related to age 
and intelligence, but to clarify these relationships requires the standardization 
of materials and methods. Osler and Fivel (1961) had 180 children, half in 
the average range of intelligence and half in the superior range, learn to 
choose one of a pair of pictures to match another of the same class Ce.g., birds) 
in order to get a marble. Fig. 3.1 displays an increase with age and a consist- 
ent difference due to intelligence. By age fourteen, all the ‘subjects of high 
intelligence were able to reach the criterion of ten consecutive correct choices. 

Dividing the children, according to the records of their responses, into 
sudden and gradual learners indicated that the sudden learners were mostly 
in the groups of superior intelligence. Presumably the superior children 
understood the nature of the task and thus used a hypothesis-testing approach 
which caused discontinuous progress, while the other children learned by a 


gradual strengthening of associations. However, a similar study with children 
of eight and nine classifying dot figures found discontinuous acquisition for 
those below average IQ as well as those above (Hamilton and Saltz, 1969). 
These are simple classes for which discontinuous acquisition would be ex- 


pected according to the general trend of the results outlined in the preceding 
chapter. 


30 High 


N 
o 
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Number reaching criterion 


FIGURE Rh, Concept learning in rela- 
ey to age and intelligence. Six groups of 
30 children each, Data from Osler ana 
Fivel (1961). 
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Recalling the importance of the instructional set in earlier work with 
adult subjects, Osler and Weiss (1962) carried out another study, like that 
of Osler and Fivel, but this time they gave half the children the explicit 
instructions that “There is an idea that will te!l you which picture will give 
you the marble [p. 530].” These special instructions were generally helpful, 
but most helpful to the subjects who needed them most, the average children 
aged six and ten. 

Undoubtedly, general intelligence is correlated in a general way with 
abstract intellectual accomplishments of the past and present. To obtain more 
detailed information about the abilities required for success in concept 
acquisition under controlled conditions, tests of narrower abilities have been 
are often put together from 


constructed. Tests of general reasoning abilit 
arithmetic reasoning problems and deductive reasoning problems, such as 
evaluating the validity of a syllogism. Tests of induction, or inductive reason- 
ing, may present such items as oak, maple, pine, and birch: the subject is 
instructed to find out what three of these words have in common and then 
Hence these are often called oddity problems. Or 
instructions. Thus 


underline the odd one. 
pictures of the four objects may be presented with the same 
an ability test and a concept experiment might call for corresponding con- 
ceptual processes, but there are differences in the methodology that limit 
the correspondence. 

The purpose of the test is to get useful scores representing individual 
differences in ability during a short testing period. The methodology of test 
: ards, such as reliability, and 
andards. The concept 
rement 


construction sets up certain psychometric stand 
the items are selected and revised to meet these st 
experiment, on the other hand, usually tests a hypothesis about improv 
in performance involving a comparison between groups of subjects, and the 
materials and procedures are selected to fit this purpose. If the two assignments 
evoke the same intellectual process from the subjects, this process may easily 
be obscured by the methodological variations. The fact that similar names are 
given to ability tests and to experiments does not guarantee similarity of the 
psychological processes. f 

The pattern of abilities that determines which people get high scores and 
which get low scores on a variety of different tests is often described by a 
statistical procedure known as factor analysis. Individual differences in per- 
formance on 15 tests are probably not due to 15 different abilities; the goal of 
factor analysis is a simpler description of the abilities involved. After a large 
number of subjects have taken the tests, all the scores are correlated: if the 
correlation coefficients are all high, it is reasonable to assume that there is a 
common factor, such as general intelligence, underlying performance on all 
tests. But often the pattern is more complicated. A group of tests, A, B, and C, 
may be highly correlated, another group, D, E, and F, highly correlated, and 
a third group G, H, and I, also highly correlated; yet the correlations between 
groups are low. Inspection might reveal that one group is composed of tests 
of verbal content, another of numerical content, and another of geometrical 
figures, hence three group factors are identified. By such procedures—partly 
Statistical and partly inferential—factor analyses of scores on batteries of 
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diverse tests have attempted to identify the principal factors responsible for 
individual differences in performance. The same procedures have also been 
used in analyzing the abilities to be discussed in later chapters. 

In a serious attempt to analyze conceptual abilities in detail, Dunham, 
Guilford, and Hoepfner (1969) persuaded 177 juniors and seniors in high 
school to spend three days taking a battery of tests and working a variety of 
concept problems. Factor analysis of the scores disclosed |] abilities, some 
described by content (such as figures, letters, and words) and some described 
by operations Csuch as cognition, memory, divergent production, and con- 
vergent production ).! The concept learning scores were correlated with the 
ability factors at several stages of learning, but these correlations were low: 
scores for success in verbalization of the concepts also had low correlations 
with the ability factors. Tasks requiring classes of figures 
be transformed, at least by some subjects, 
is correlated with verbal abilities at some 
agreement comes from another factor analy 
CLemke, Klausmeier, and Harris, 1967), which found factors of general 
reasoning, induction, and verbal comprehension in concept learning. 

Since instructions about concepts influence performance, they might also 
influence the pattern of abilities required. Dunham and Bunderson (1969) 
therefore gave a battery of standard ability tests and concept learning tasks 
to high-school students with and without instructions in decision rules for 
discovering the concepts. The well-known induction factor contributed to 
concept achievement under botl 
rule instructions were given, 


and of letters can 
into verbal tasks, so that success 
times but not at others. Some 
s of similar performance data 


h conditions, as one might expect. But when 
the general reasoning factor also contributed, 
and when rule instructions were not given, associative memory was involved. 
Such findings are consistent with the contributions of associative learning 
and reasoning to performance i 
chapter. 


The general drift of these r 


n the concept experiments of the preceding 


esults is that if anyone wishes to predict which 
students will achieve concepts most readily, a test of general intelligence 
gives a good indication. If more precise prediction is desired, tests of induc- 
tive thinking, general reasoning, and verbal comprehension are worth trying. 
But in order to adapt teaching procedures to individual differences by varying, 
the instructions and materials, similar data from the specific situation in which 
the procedures are to be employed will be needed. 


Cognitive Development 


The story of intellectual growth is an intricate 
characters, plots and subplots, which we shall start to take up now and 


will resume again later. This chapter will provide a synopsis and will 
emphasize the role of the concept, certainly a major one. 


one, with major and minor 


A i i iv oning is inc! 
1Īn this terminology, inductive reasoning is included 


8 gnition and deductive 
reasoning u r convergent production. 0; 
i nde: gent d nder c 
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As children grow, their intellectual performance expands in all directions. 
Older children have more experience, more interest in intellectual things, 
more understanding of what is expected of them, better memory, and better 
control of their attention. Higher scores on intelligence tests are an index of 
the general quantitative improvement. This development is influenced by 
biological maturation, similar in all cultures; but beyond this, development 
proceeds in the direction of the particular adult culture in which the children 
mature. With a few exceptions, development is cumulative: elementary 
schoolchildren build on their preschool concepts as high-school students build 
higher on their elementary-school achievements. 

Some of our knowledge of cognitive development comes from research that 
makes these generalizations more precise, by comparing the achievements of 
children of different ages and plotting curves of their progress. The more 
exciting and more difficult research attempts to uncover the nature and 
causes of such development, the qualitative changes as well as the quantita- 
tive. All theories of development predict that older children do better than 
young children, hence graphs and tables that show age changes do not sup- 
port any one theory. More specific descriptions of achievement and more 
specific hypotheses of their origins are necessary. 

Methods for studying cognitive development could begin with the methods 
of concept learning research, as described in the preceding chapter. A few 
illustrations were given there to show how experimental psychologists have 
modified their experiments to make them suitable for children. By combining 
experimental and developmental methods, Kendler and Kendler (1962), as 
we noted in the preceding chapter, have described a transition during the 
early school years from single-unit association to mediated association in con- 
cept learning, More recently, after analyzing the data of a concept shift 
study including college students, they have suggested that with increasing age 
an increasing proportion of their subjects were operating in the hypothesis- 
testing mode (Kendler and Kendler, 1970). Experimental child psychology 
isa growing trend, but for the most part research styles have not been copied 


from experimental psychology. Observations and tests of children aimed at 
i have been more common than 


ariables. Susan Isaacs (1930), 
’s intellectual activities, espe- 
d then interpreted her data 


attaining measures of status at different ages 
experimental manipulations of independent v 
for example, carefully observed young children 
cially their Why questions, in natural settings an ( 
: of cumulative experience and maturation. 

e classifying or sorting tests, not only to measure 
make qualitative inferences about the child’s 


to show the interplay 
Child psychologists often us 


Progress in learning, but to ý 
conceptual activities. A common procedure is to spread out before a child 


a number of blocks or other objects, with instructions to “put the ones to- 
gether that belong together.” The Vygotsky Test, which has been described ii 
English by Hanfmann and Kasanin (1937) and can be found in many 
clinics, represents another procedure. In this test, 22 blocks of various sizes, 
shapes, and colors are displayed to the subject on a table. The examiner states 
that there are four kinds of blocks, the names being concealed on the bottom, 
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He turns one block over, showing the subject its name. The subject is in- 
structed to find other blocks like this one, put them at one corner of the table, 
and put the other three kinds at the other three corners. The subject may 
begin to group the blocks on the basis of color, but then sees that there ie 
five colors. He may begin with shape, but there are more than four shapes. 
The examiner can correct the subject, if his grouping is wrong, by turning 
over two or more blocks in a group and showing that they have different 
names. Overall scores may be obtained from the time required and the number 
of corrections given, but Vygotsky (1962) and his collaborators in Russia 
studied the test performance qualitatively in order to make inferences about 
conceptual and linguistic processes in children and adolescents. 

Many observers have reported that on such sorting tasks young children 
group objects in collections—Vygotsky called them complexes—rather than 
classes and by concrete perceptual similarities rather than by abstract proper- 
ties. The concrete syncretic language of the child has also been contrasted 
with the more flexible symbolic language of the adult. We have noted how 
children differ at different ages 
Thus, while some 


in their initial responses to perceived objects. 
psychologists have been following the quantitative route 
toward reliable measures of intelligence and smooth curves of intellectual 
progress, others have been following the qualitative route toward describing 
cognitive structures and processes at different stages of intellectual develop- 
ment. Much of the early qualitative work was summarized and published in 
English some years ago by Heinz Werner (1940). 

In recent 


years the study of cognitive development has been dominated by 
the influence of Jean Piaget, of the Unive 
reactions to it have brought new life 
a whiff of fresh air into the 


rsity of Geneva. His work and the 
to research on cognitive development and 
concept learning laboratory. 

PIAGET’S STAGES AND STRUCTURES 
Piaget's grand design is to ch 


art the growth of the child from a sensori- 
motor animal, beginning life with some reflexes and some sensory capacities, 
to the abstract thinker of fourteen or so, who uses arbitrary symbols to refer 
to abstract classes, carries on formal logical operations, and detaches himself, 
so to speak, from his sensorimotor limitations to spend his life with the cogni- 
tive structures of the adult culture. Returning to the meanings of thought 
given in our introductory chapter, it is not the introspective analysis of thought 
as mental experience that interests Piaget, nor the objective ‘description of 
problem-solving behavior, but the cognitive schemas, the structures of knowl- 
edge, which guide the observable behavior, 

In the first year or two the child, according to the Geneva school, has only 
sensorimotor schemas—one could hardly say knowledge—of things. He knows 
a plaything only in terms of what he does with it, how it mee when he 
grasps it, how he transfers it from one hand to another, how it looks and 
tastes, how it drops when he lets go. Adults are likely to separate perception 
from action, but the child’s schemas are not differentiated at this stage. One 
of the first actions is looking. 
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The assumption of coordination between sensory and motor systems in 
perception of the environment is not peculiar to Piaget, of course. It is the 
standard doctrine, although the language differs, among modern students of 
perception. In a discussion of “active touch,” for example, Gibson (1962) has 
described in subtle detail how motor and sensory systems cooperate to collect 


information about objects in the environment. 

But around age two the child develops relatively permanent schemas for 
objects, so that he can identify a toy whether he sees the front or the side of 
it, whether he is biting it or throwing it, and even after it has rolled out of 
sight. His actions and thoughts are controlled by the schema even though 
he is no longer actively handling or looking at the object. Freed from depen- 
dence on his actions and the feedback from his actions, his knowledge of 
the object moves from the motoric and the perceptual toward the cognitive. 
The child may acquire a name for the object and thus we might say, in 
the language of an earlier chapter, that he has acquired a singular concept. 
But Piaget calls this stage preconceptual because the child does not use 
concepts flexibly, as he will later. He has grasped the permanance of objects; 
he perceives relations and can choose the larger of two pieces of candy. But 
these accomplishments are still more perceptual than conceptual. 

From age two on, his intellectual operations become more elaborate, and 
around age seven flexible, reversible, cognitive structures and operations can 
be assumed. A reversible structure, to the Geneva school, is one that permits 
the elements to change in form while overall equivalence is preserved. Like a 
class concept, it permits the child to grasp the abstract similarities nie o 
superficial perceptual differences. In general terms, it means knowing t s 
if AB, then BA. If A+B=C, it is possible in thought to reverse the act o! 
addition to get C—B=A. i 

The celebrated demonstration of “conservation” involves the reversibility of 
a physical change. A lump of Plasticine or modeling clay is shown to a child, 
then it is squeezed flat, and the child is asked if it contains the same amount 
of material as before. At age five or six the child will probably say No. In- 
fluenced by one dimension or the other, the preoperational child will say it 
has more or less than before. But if it is molded back into its original shape, 
so that the child can perceive the equivalence, he will agree that it is A 
same as when first seen. A parallel test for conservation of liquid TE ma 
a glass of water, which is poured into a tall, thin glass, and into a T ae 
glass, each time with questions about amount. After age seven or so, the chi d 
has a cognitive structure that enables his thought processes to follow the 
physical processes and conserve the integrity of the material in spite of changes 
in form. He has a schema, albeit rather concrete, of conservation of matter 
that overrides the perceptual instabilities. When he sees the clay flattened, he 
can reverse this change in thought and thus answer questions about amount 
correctly. asi 

After seven, the child performs many indirect thoughtful operations, ac- 
cording to this account, at least on objects that he perceives concretely, The 
concrete-operational child can put things into classes and combine classes. He 
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can reverse some of the simpler relations, like taller than and father of. [ce 
knows that if A is the father of B, then B is the child of A. The five-year-old 
knows his right hand and his left hand, but not until seven or eight will he 
be able to point to another child's right and left hands. One way of putting 
this is that at this time, the latter part of the stage of concrete operations, he 
can do most of what adults do with complex problems of combination and 
physical space, but only with particular materials. 

Around age twelve the third major intellectual development begins, the 
development of formal abstract thought. Given a rich cultural environment, a 
child attains mastery of many logical operations within a few years and thus 
completes the liberation of his thought processes from their dependence on 
concrete particulars. 


The Geneva school does not insist on the ages at which these stages occur, 
but the sequence of stages is central to the theory. The child does not reach 
the stage of formal logical operations unless he first goes through that of 
concrete operations. Piaget was first a biologist, and the concept of maturation 
is prominent in his speculations. But this is not tissue growth or maturation 
in the neurophysiological sense: it is the cumulative, andes assimilation of 
the normal experiences of life, which organizes schemas that liberate the mind 
from control by any specific experience. 

The role of experience in intellectual development is not that of rewarded 
Practice, at least not in the conventional sense of learning due to external 
reinforcement. Rather, the child acquires a new cognitive structure when he 
perceives some conflict in his experiences and forms a new equilibrium from 
them. It is when a child experiences the discrepancies in his perception of the 
ee a different times Mat he tries to devise an atemporal cognitive 

a. à is Cognitive conflict. The learning is cognitive and the 
neh ieee ee Caer wore iad dam poe 
i dase at vie eit, beef is apsis should not sound rad r 
peresom, Woa n nown discussion of the reinforcement ; 

; : a — similar reasoning, and 
pe ~ OSists Write often of cognitive conflict and balance in 
cine are hs ae ene hey the nino 
theories when applied to these subtle intell Sele _ ai 

ectual achievements, 


FIGURE CLASSES AND DIMENSIONS 
be aan Sere: work of the Geneva school on concepts is that of 
= e a los he on g pssification and seriation, which describes 
ow children classify objects an : # i 
y ow they arrange them in ordered series. 


a sucuk concept achievement) is not 
an explicit part of the Genevan program. Inhelder and Pisci are woncemed 
rather with children’s classification of Perceived objects ad their replies to 
questions about the classification; and they are willie, PAD ai the 
child’s communicative skill is adequate to the task—if fe einen we 
perienced and astute. 


COGNITIVE DEVELOPMENT * 10] 


Piaget establishes rather strict logical criteria for a class because of his 
interest in genetic epistemology. The class is characterized by extension: the 
set of instances that belong to the class. It is also characterized by intension: 
the properties of the members of the class. A child may be able to discriminate 
dogs as instances of one class from cats as instances of another without being 
able to describe the properties of either. He may be able to talk about dogs 
in the conventional way, but he does not really have the class dog until he 
has the superordinate class animal and knows about other animals that are 
not dogs, as well as the similarities and differences between these classes. In 
general terms, Piaget says that the child does not use classes correctly in 
j il extension and intension are coordinated, so that 


Cognitive operations until e J l 
consistent with the general and specific 


the objects included in the class are 
Properties of the intensional definition. 

It is somewhat different for an ordered series. While the child cannot per- 
rceive some relations: similarities; symmetries; 
and farther. But an ordered series is more than 
and other transformations within the series, 
Ives, but on the abstract dimension 


Ceive classes per se, he can pe 
and inequalities, such as larger 
Perception; it implies transitivity 
based, not on the perceived objects themse 1 
common to all of them. A child may perceive that A is larger than B and B is 
is larger than C, but he does not make the logical inference that A is larger 
than C until he has achieved the transitivity schema. This is the reason, ac- 
cording to Piaget, why even though a child can aoa ge ee 
3 longer than another, = ability to place three or four blocks in order o 
en j sly. 

oe ae certain preconceptual attainments AS necessary. 
Maturation is not enough; language plays an umportant role but a at suffi- 
cient by itself. The chief supports are the sensorimotor schemas, ua t üp by 
Perception and action. When an object is presented wa sas he identifies 
it as if by comparing its visual, tactual, and instrumenta pora with 
whatever schemas he has developed. These diverse properties will later become 
the intension of the class. (Thus Osgood’s ee CONSEUEE: page 
66, and Piaget’s schema are similar in that pag 45 meen an 
object depend, at least in part, 0? espousa to the 4 i eia i S eneva 
school the schema constitutes a basic pect rence meee hon Fie fol 
it comes directly from perception but because i percep 


back ‘ ipulation. ; 
me Teye SELA of the objects displayed and his answers to ques- 
e child's arrangem d Piaget in relation to these idealized 


tions are i reted by Inhelder an 

interprete y A 

Cognitive wy eee which mark the stages of progress toward adult logic. 
: 


The method of free or unconstrained dystion is used, with instructions 
to put together the ones that are alike—thoug! Peg i are some- 
times iatied in order to bring out critical — a sae The 
objects to be classified may be beads, geometric forms, pictures of plants, and 


: d other familiar attributes. 
the like ino in color, size, shape, 27 : 
Pe har a and five, lacking true classes that are independent 
n el a : 
of ihe locati a he elements, often place the objects where they form 
Jon 


Spatial patterns or little aggregates of similar objects that seem to satisfy them, 
= Di 
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One child, for example, put the objects in a straight line, each object having 
some attribute in common with the next, though the common attributes 
shifted along the line. This is what Vygotsky (1962) called a chain pais 
The child at this stage seems to change his intensional a and his 
`, i a cs a 7 f- a Pi 1 a oe 
basis of grouping as he looks from one part of the collection to anothe — 
At later ages, five to eight, spatial Juxtaposition is less common as a a 
for grouping. Heterogeneous residues, which at younger ages are often left 
over from inadequate partial collections with shifting criteria of similarity, 
are no longer seen. Then comes an important step, building on what has 
gone before, when all the objects are 
fication—all the squares in one pile 
Inhelder and Piaget insist, however, 


arranged by a single criterion of cla 
, all the circles in another, and so on. 
that the development of classes is not 
complete until these are differentiated into subdivisions so that the child can 
operate by inclusion and exclusic 
class does not exist in isolation. 
One study by Inhelder and Piage 
their methods of observation 
ials consisted of 20 cards, 4 of which represented colored objects, the other 
16 representing flowers, Eight of the flowers were primroses, 4+ yellow and 
the other 4 other colors, The child was expected to answer such questions 
as: “If you make a bouquet of all the yellow primroses, will it be larger or 
smaller than a bouquet of all the 
will there be any primroses left?’ 
answered correctly, 


n to obtain a hierarchical classification. A 


t on the addition of classes will illustrate 
and the inferences made from them. The mater- 


primroses?” “If vou cut all the flowers, 
’ By age ten such questions are most often 
For example, when the question was, “Are there more 
primroses or more yellow primroses in the bouquet?” the proportion of correct 
replies increased from 30 percent for children of five and six to 96 percent 
for children nine and ten, Above age eight questions such as, “If one cuts all 
the flowers, will any primroses remain?” 
“Anyone who takes all the lowe 
is at last adjusted to intension, 
e must remember that Piaget 
cerned with the use of concepts in 
more concerned with the use of con 
class A, that might be primroses, 


are usually answered correctly: 
rs takes all the primroses too.” Thus extension 


i irectly con- 
and his colleagues are not directly cor 
i . ne sy are 
interpersonal communication. They ar 

‘ mah $ m 
cepts in the child’s own reasoning. If ¢ 


is contained in the superordinate class B, 
that might be flowers, logical reasoning calls attention to the complementary 


class A’ consisting of the Bs that are not As, in this case the flowers that 
are not primroses. In order to get credit for conceptual thinking from the 
Geneva school, the child must know, in a formal abstract way, that the 
class B includes the class A and also all the A’, and thus that the number 
of elements in the class B is necessarily larger than the number in the class 
A, regardless of the number of each in view, The addition of classes can be 


ce, ally : B= r hi 5 : z rowth, 
expressed formally as: B A+A - A child might, at one stage in his growt 
be able to use the terms primrose and flowe 


unable to perform these logical operations o; 
ments in each class. 


: j : i 
r in ordinary conversation but : 

ye Je 
T answer questions about the e 
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NUMBER CONCEPTS 
Number classes, like others, are based on a common abstract property. All 


classes of two things, designated by the term two, have the property of twoness 
in common. But, while classes of flowers are arranged in a hierarchical system 
or taxonomy, classes of numbers are arranged in a series, characterized by even 
more abstract properties, such as asymmetry and transitivity, which permit 
very abstract and very efficient operations. The small number classes and their 
names, such as one, two, and three, are acquired by young children in ordinary 
domestic communication in connection with pointing, picking up, and asking 
for more, but they remain tied to the perception of concrete materials. The 


larger numbers are acquired later, in connection with counting and other 


serializing operations. 
y zi 3 ë aT a ue 
One of Piaget's methods (Flavell, 1963) for studying the development 
from perception of numerical correspondence to more abstract operations was 
of beads and ask him to take the same number from 
The method was similar to the number matching 
method that Long and Welch employed in 1941 (see page 839 inst whereas 
they stressed standardized testing and quantitative treatment, | iaget stressed 
litai corvations, Piaget reported that the young child is likely to 
qualitative observations. Piaget rep i ifl beni f 
r zarie ‘|, with more beads or fewer 
make a row of about the same length as the mode er, 
than abstract control. Later the child 
is likely to make use of a one-to-one correspondence, placing d bead ‘a 
the first one of the model, another bead opposite the EXE Jans sn) Spe ae 
spondence by spreading the beads of the model 
$ oa may have to add more beads to make the 
farther apart, the child at this stage may vo merical properties be pre- 
two leneths look the same. Only later will the nur al prop I 
pr Sn i i i As Flavell put it, Once more a 
ptual dislocations. /\$ ; 
able operational existence through a cobweb 


to give the child a row 
a pile of beads nearby. 


thus demonstrating perceptual rather 
the examiner destroys the corre 


Served in spite of perce 
concept must fight its way into st 


X aT 313].” 

of sion-pr sing perceptions [p. 312 = AG. ai 
illusion producing k bk a the correspondence between series of dissimilar 
o look 4 


the child was shown ten dolls of differing 
ks of differing heights and told to ar- 
asily find the stick that belongs to it. 
fr the voungest children because it 


The next step is t : 
objects. In one of Piaget's studies, i 
heights and ten miniature walking stic 
range the sets so that each doll D 
This seriation problem was impossible 


srties of ? 
depends on such abstract eet all poate cae ee sa 
ai at the s 
for example, the cognition that 


the firs he same time shorter than the third. The older child who 
he first and at the id seriate the other, however, and establish the cor- 
d seria j 
$ e although some children lost the correspondence 
ee ee : ent was ‘disturbed by the examiner. The Beckwith 
nm s x 
'g 86) remind us that even college students make 
ible to speed up counting. 


the series as reversibility. It requires, 


could seriate one set cc 


when the perceptual a 

and Restle experiments Cpage z 

use of perceptual patterns when poss 
URES 

VE STRUCT ; 

EE development via the experimental psy- 

© often have difficulty in understanding just 


CONCEPTS AND COG 
Students who approac a 
chology of concept attainment 
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what Piaget's structures are. Are they the same as the concepts of the preced 
ing chapter? 

The concepts of experimental psychology and the cognitive structures of 
the Geneva school are both ability constructs. People who have acquired them 
can do things that others cannot do. Both are general achievements, not 
specific responses to specific stimuli. And both are abstractions, inferred from 
what people do and say. But a structure may be more or less than a concept. 
To Piaget, classes and dimensions are just two kinds of structures. Larger 
cognitive structures include systems of classes, even in the minimal case of 
flowers, primroses, and flowers that are not primroses. The structures under- 
lying the achievements of conservation and transitivity are more than simple 
classes and might be called principles or explanatory concepts. In fact, Piaget's 
questions, aimed at description of the structure, often ask for explanations of 
superficially puzzling events. 

If we define a concept to include the svmbol that refers to the objects or 
relations between them, the concept may be more than the structure because, 
to Piaget, the structure is used by the child in his own thinking and does 
not necessarily have a label for use in communication. Verbal methods as well 
as behavior methods are necessary for examining the child's thought struc- 
tures, but a structure may be operational even though it is not verbalized. 
That is why, we are told, the examiner must be informed and skillful. 


RESPONSES TO PIAGET 


While research continues at Geneva, Genevan ideas have traveled far. 
Reactions, as to all new ideas, have varied CIsaacs, 1930; Kessen and Kuhlman, 
1962; Lovell, 1966). Regardless of the evidence, many psychologists have an 
a priori distaste for the cognitive structures, the flexible methods, and the 
speculative style of the Geneva school. Piaget's insulation from the predom- 
inant trends in English-language psychology gave him the freedom to elaborate 
an original set of ideas and gave others the freedom to ignore them, as he 
ignored the ideas of others, To many psychologists, however, the impact of 
Piaget was to make the child’s intellect 
social and sexual development. If the 
fluence agree on anything, 
again. 

The most fundamental criticism of Piaget and the Geneva school is directed 
against their methods for inferring the existence of a cognitive structure. The 
demonstrations that they report assume that these structures can be directly 
diagnosed through the child's behavior and 
tive examiner familiar with the responses 
Others, such as Braine (1962), argue that th 
ical construct that can be validated only by 
test situation—including test materials and jinstructions—and to the resulting 
verbal and motor behavior. When other investigators get similar results with 
similar methods, the empirical findings are con fried, but the possibility of 
an alternative interpretation is not eliminated. An inadequate reply to a 


ectual development as exciting as his 
many evaluative essays on Piaget's in- 
it is that child psychology will not be the same 


answers to questions by a sensi- 
of younger and older children. 
e cognitive structure is a theoret- 
specifying how it is tied to the 
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question about classification of figures may be due to a lack of verbal skill or 
the inability to attend to the critical dimension, as well as a lack of a certain 
logical structure, Smedslund (1969) has listed specific examples to illustrate 
how an apparently inadequate response may be due to failure to understand 
the instructions or the information given or simply to forgetting them, rather 
than to lack of cognitive structure. Gelman (1970) has devised a very sensitive 
number test for children of 3 and + which gives positive results for conserva- 
tion of the numbers 2 and 3 in spite of spatial rearrangements, even though 
other children of the same age could not pass the conventional test of con- 
servation of these numbers. In the same way an apparently adequate response 
may be due to guessing, direct perception, or accidental success rather than 
to achievement of the structure. 

Piaget, like Freud, will not be proved right or wrong. There seems to be 
little dispute about the major theme of development, that it proceeds from 
the concrete perceptual to the abstract, but questions have been asked about 
many of the minor variations on this theme. Some are too vague to test, but 


aaf . si cere m ‘ ated boa : 
others have inspired interesting research that can be illustrated by a few 


examples. 


i :NTAL STAGES 
Gra o ‘question raised by the apparently casual Geneva methods 
; Will other examiners working with other children get 
h > In general, the answer is a qualified Yes. Somewhat similar 
the same results: In s = 1930s by Vygotsky (1962) in a book on the 
stages were described n a children that has only recently become 

4 > O ssid A : 
thought and language Werner (1940) working with German 


; i oe. GG 
available in English, and also by) es w 
children. The vok of Smedslund (1961a) on conservation of weight and 


itati ea is type. 
gilbstanaar | lhrierates well-controlled, quantitative a arch . ig a = 
ma Same e ercent, 1 in he 
found ‘chat two midges agreed very well, 95 to ps f “ae E 5 
child 's | a s and that Oslo children were roughly similar in 
rens exp anations ¢ 


i F nsistency between 
kinds of ex ions to Geneva children. He found less co yt 
t = nana li e of development than that reported 
ests and a somewhat d 


by Piaget, however. coated research giving general support to 
A p con of sophisticatec S he: devel 
; nother illustration O splication of scalogram analysis to the develop- 
: ap ; 
Piaget is Wohlwill’s (1960) Tiie a different language, Wohlwill began 
sl « = ä 
ment of number concepts- : velopment of a concept has its origin in an 
with the premise that the develoy proceeding thence by a 


i «on of abstraction, 
essentially discriminative funct T mediating structures to an eventual state 
) ation O aa 
process of gradual elaboration 
t-a 


i z sets as a purely representational symbolic entity [p. 
in which the concept exists 4 en have been used primarily in the field 
ti S 


» tec A k 
345].” Although ale ey can be applied to a set of tasks presumed to 
of attitude measurement, th ‘ysis of the patterns of success and failure by 
develop in sequence. An ana’y*» hether these tasks constitute 


ä veal w. 

j i ild then reve A 

children of different ages paet be concluded that mastery of a given task 
a scalable set, If they do, it ™® 


all tasks below it in the hierarchy of difficulty, 
presupposes the mastery ° 


is an empirical one: 


ferent sequence 


(o) 
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which is equivalent to the assertion of a sequential order of development of 
the functions tapped by these tests. 

Wohlwill employed a carefully planned matching-from-sample method: 
choice cards containing two, three, and four dots were hung in view in the 
apparatus, then a sample card containing perhaps three figures was placed 
in front of the child to match to one of the choice cards, a chip being the 
reward. Some preliminary practice assured understanding of the task. The 
tasks were selected to test the child's ability to transfer the response to 
problems in which the correspondence between sample and choice cards 
requires symbolic mediation in a variety of forms. The tasks, in theoretical 
order of difficulty, follow. 


A. Abstraction. In this task, presumed to be the easiest, the sample cards 
differed from the choice cards not only in number but also in form and 
color. Thus the child had to abstract the dimension of number from among 
other, irrelevant dimensions. 


B. Elimination of perceptual cues, The sample cards for this test con- 
tained rectangles drawn in outline and divided into two, three, or four equal, 
adjacent Squares, separated by thin lines. It was assumed that perceptual cues 
were thus eliminated and correct response would depend on the intervention 
of a symbolic process, such as counting, 


C. Memory. The sample cards were those of the training series, but the 
choice cards were removed so the child had to make his choice by the posi- 
tion of the choice cards as he remembered them. 


D. Extension. This test exposed choice cards of six, seven, and cight dots, 
and corresponding sample cards were in view. Since numbers of dots higher 
than five are not immediately discriminated, it was assumed that mediation 
of some symbolic activity would be necessary. 


E. Conservation of number. The choice cards contained six, seven, and 
eight dots, but small buttons were used in pl 
tons were first arranged in a configuration which matched that of the 
correct choice card; then the buttons were scrambled and the child had 
to match to the correct choice card without counting. This test differs from 
the previous ones in demanding a concept, not of a specific number, but 
of number in general, and more Particularly of a functional property of this 
concept: its independence of configurational aspects of the stimulus aggregate. 
As noted above, Piaget postulates that the conservation of meei in- 
dependent of perceptual variations, is a central step in conceptualization. 


ace of sample cards. The but- 


F. Addition and subtraction, This te 
button was either added to the collecti 
from it. Here the test seems to assume 


st was like E above except that one 
on in view of the child or subtracted 
conservation of number and to go one 
step beyond it. 

G. Ordinal-cardinal correspondence. The choice cards held two, three, 
and four dots; the sample cards showed eight solid bars, one of which was 
red, arranged in order of increasing length; and the ordinal position of the 
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TABLE 3.1. NUMBER oF CHILDREN (ouT OF 72) 
PASSING EACH OF SEVEN NUMBER TESTS 


B. Elimination of perceptual cues 49 
A. Abstraction 46 
©. Memory Se 
F. Addition and subtraction 24 
D. Extension ra 
E. Conservation . 
G.  Ordinal-cardinal correspondence 6 


From Wohlwill (1960). 


red bar was to be matched with the number of dots on a choice card. That 
is, if the red bar was third in length, the card with three dots was the cor- 
rect choice. This correspondence between ordinal number and cardinal num- 
ber is a rather abstract attainment which is presumed to depend on the 
attainments measured by the previous tests. 

n to 72 children, age four to seven, the order 
of difficulty, shown in Table 3.1, was approximately as expected—although 
i ared to D and E—and the overall performance 
age. More important, the scalogram 


When these tests were give 


Was surprisingly casy comp 
showed the expected correlation with 
analysis indicated, in Wohlwill’s words, 
z in the response patterns of the individual Ss, 
esupposed, in general, success on 


a very substantial degree of orde: ; 

ala £ be i 
ie that the mastery of ony given, irem m inten Fp st ge 
all easier items. Such a finding Jus = à 
conceptual complexity underlying these problems and suggests the LD 374]. 
of a consistent developmental process in the conceptualization p. 374]. 
i av 7 ess support to the sequence 
A series of studies by Dodwell gave somewhat less 

es of studies by Se amd by Piaget. Dodwell (1960) 


of develop f ber concept 
ment of number à ; 
confirmed Piaget's contention that young children oy fully a 
aget’s ; gh 
the concept f ber, even though they may be a ew gun ut the 
ee eg were quiet different. Dod- 


S : adii ildren, 
sequences he found, with Canadian ea + be fitted to a quasiscale (1961). 
Well’s results, unlike Wohlwill’s, could only 


A later study (1962) compared number sich nee a the ae of 
` SUY im ; assumed to be at the same 
classes but found that correlations between tests as 


level were quite low. 


: jaget’ ge comes from res 
Another limitation on the generality of Piaget's stag esearch 
f § 


i $ epts of nationality and geo- 

by J $ , children’s concep y g 
y Jahoda (1964) on Glasgow Diner 4 E 

raphy. Te adopted the questions used by sig ene ie eo work 

a i Ji 7. 7 u A 

on classes, asking the children whether they were " i As E he they were 

British ahal ae they could be both at once, an m hether they lived in 

4 y ey à arities 2 seca i 

lasgow or in Scotland Certain developmental nog sei ae in the 

atiswers but not the developmental stages of the Geneva school. The Geneva 

stages, v ith classes of beads and flowers, may not be applicable 
ges, worked out with class 


ionality. 

to these F septs of nationality , l 

Me e — ee oat of conservation are influenced by small varia- 
easures of achieve 
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tions in method, as well as content, more so than one would expect of stable 
cognitive structures. Consider the father who is a doctor; is he still a father? 
An affirmative answer would be evidence of conservation of the concept of 
father in spite of additional information about a particular father's occupa- 
tion. Sigel, Saltz, and Roskind (1967) asked three such questions of children 
aged five to eight, but introduced some variations. When the children were 
told that the father was a doctor, 69 percent gave replies indicating conserva- 
tion; but when other children were told that the father studied and became 
a doctor, the proportion dropped to 39 percent. When representative cutout 
pictures were presented instead of verbal questions alone, the proportion of 
correct responses dropped from 71 percent to 42 
the pictures contained irrelevant, distracting info: 
The consistency 


percent, presumably because 
rmation. 


that showed up in this experiment should also be men- 
tioned. The children who had three conservation tests, de: 


i aling with father, 
mother, and sister, under the same conditions, were very consistent, Most of 
those who exhibited conservat: ) 


i 'on on one test exhibited it on all three; most 
of those who failed one failed all. Such results 
across variations in content, though the v 


as large as from beads and flowers to n 
The examination of children 's 
by the cross-sectional method, but Alm 
the longitudinal method also by at six th intervals, begin 
P s . S AE: a pane ; 
ning in kindergarten, Although t j an i pe 
hiey : gh the two methods told the same story 
achievement of conservation of n 


umber and quantity »rcentages 
a uantity for overall perce fz 

t j ge : ] A n 
at any one time, the longitudinal data disclosed fluctuations in the progress 


of individual children during this transitional period. About half the children 
moved ahead steadily, but many who performed well on an early examina 
tion regressed on a later one. It is difficult to evaluate the magnitude of these 
fluctuations psychometrically; but they are probably not entirely due to um 
reliability of measurement because correlations between .50 and .60 with 
measures of intelligence set a lower limit to the reliability of the conservation 
measures. At present when a child gives evidence of conservation, we cannot 
be sure that he will do the same at another time by another method of examina 
tion with quite different content. : 
Despite continuing questions about methods, the cumulated evidence has 
established that Piaget's stages are more than epistemological speculations: 
they can be understood and investigated by other psychologists, with in- 
teresting empirical results. The final certificate of scientific respectability was 
attained when computer programs for simulation of these stages were aie 
(Gyr, Brown, and Cafagna, 1967). Ambiguities of method were bypasse 
by arranging induction problems with lights and switches, hence the aP 
paratus resembled the problem-solving apparatus of the next chapter 4? 
the problems resembled the rule-learnine problems of the preceding chapter 
Since the circuits that controlled the lights were not visible, of course, © 
subjects were free to think of the Geo in terms of their own cognitive 
structures. Programs written to simulate preoperational, concrete-operationa » 


are evidence of consistency 
‘ariations from father to sister are not 
ationalities, 

thinking has proceeded, with few exceptions 
y (1966) exploited the advantages ° 
examinations 


the 
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and formal-operational thought turned out different sequences of responses, 
and when students in grade school and college worked the problems, their 
response sequences had some relation to the Piaget stages. For example, 65 
percent of the children in the fourth grade Cages nine and ten) exhibited be- 
havior like the model for concrete-operational thought, as expected at this 
age, while 22 percent exhibited behavior like the model for preoperational 
thought, and 13 percent like formal-operational thought. As in the concept 
experiments of the preceding chapter, detailed instructions raised the level of 
performance. 

It is fair to conclude, from these few samples of recent studies, that stand- 
ardized procedures and careful control of the test situation do not destroy the 
phenomena to be studied. Reliable multiple-choice tests of conservation of 
quantity and weight, additive composition of numbers and classes, and other 
constructed to yield scores that correlate closely with 


structures have been 
more flexible interview procedures 


the corresponding scores obtained by 
(Freyberg, 1966). For 151 schoolchildren ages five to seven, the scores cor- 
532 and with spelling .418. Furthermore, these 


t two years later: arithmetic computation .524, 
571. Such results, which are typical of results 
es with smaller samples of children, show that 
ng tests and achievement on standard 
educational tests of similar content have much in common. In fact, items 
testing achievement of conservation, transitivity, and the like could be in- 
cluded in a test of general intelligence where their correlations with total 
scores would be similar to the comparable correlations for vocabulary and 
arithmetic items. 

Thus, carefully controlled research clearly does not destroy the phenomena 
to be studied: on the contrary, it yields more precise description of the stages 
of development and illuminates inconsistencies due to variations in content 
and method that might otherwise escape our notice. Apparently, if a child 
has only a vague, unstable structure, some test questions and some materials 
may bring it to operational status, at least temporarily, and others may 
obstruct it. There is considerable evidence that all manner of task variations 
can facilitate or impede the child’s ability to give en cd 
responses during the transitional period between six an rae ; Serf em 
Hill, 1969). Hence differences in the criteria of achievement and in Hexibility 


of probing for evidence can produce wide variations in results. oo 
materials and procedures for individual testing have now been stam ardized 
so that several psychometrically adequate measures of conservation can be 


obtained (Goldschmid and Bentler, 1968). 


related with intelligence 
Scores predicted achievemen 
arithmetic problem solving 
from other correlational studi 
achievement on Piaget's trend-settir 


DEVELOPMENTAL PROCESSES 
Piaget's assertions about the acquisition © 
a new equilibrium are not explicit, but assertions about stages of development 
implications. To say that the child does not perform 


has progressed gradually through several pre- 


of a new cognitive structure by 


do have explicit negative 
logical operations until he 
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operational stages is to challenge the cficacy of education and the pon 
of educators. It is this aspect of Piaget's writings that has stimulated critica 
discussion and research by educational psychologists. No one objects when 
Piaget states that the child constructs reality by experience with it nor Š hen 
he implies that varied experience with concrete objects is good preparation 
for the achievement of abstractions. But those with faith in learning _ 
against what they interpret as assertions that learning must wait on interna 
autonomous changes. 
Piaget’s equilibration theory assumes that conservation results, not from 
external reinforcement, but, internally, from an organization of the perceptual 
feedback from addition and subtraction. Adding means more, subtracting 
means less. If no adding or subtracting is perceived, no change is perceived: 
hence conservation has been achieved. Smedslund (1961b) reversed this 
logic and tried to extinguish conservation of weight by tricking his subjects. 
On the basis of tests of conservation of weight, children Gye to seven were 
divided into a group that had achieved conserv 
be given practice in this performance. The practice consisted of weighing 
balls on a balance before and after deformation until they were able to give 
correct explanations. Smedslund’s test pro 
Plasticine on a balance showing that 
molded one into a sausage, inconspicuously removing a small piece so that 
one weighed more than the other, and asked for an explanation, Six of 
the 13 who had attained conservation in the norm 
resisted extinction, but none of the 
with external reinforcement did so. 
Wohlwill and Lowe C1962) tried to teach kindergarten children conserva- 
tion of number but did 
effective. Four conditions of training were tried out but none of the groups 
improved more than the control group which had only the pretests and 
posttests. Apparently, merely giving the pretests and posttests aroused fie 
cognitive conflict and equilibration necessary for some subjects to achieve 
conservation. , 
Limited success in training for conservation was obtained by Beilin 
(1965) with an apparatus similar to that of Wohlwill and Lowe for testing 
for conservation of number. There were three parallel columns of corks, 
as in Fig. 3.2A, and the child was asked to choose the row that was like the 
middle row. The apparatus permitted contraction of the middle column so 
Ptually while the number was conserved, as 
in Fig. 3.2B. Then the child was asked again to choose the column that 
was like the middle column. Training Was given by several methods, includ- 
ing Smedlund’s equilibration method, ] 
was a verbal explanation: 


ation normally and a group to 


cedure was to place two balls of 
they weighed the same. Then he 


al course of experience 
. raini y 
11 who had received the recent training 


not find that ordinary reinforced practice wa 


that the length changed perce 


affective 
but the only one that was effectiv 


Now I am moving them. See, they are standing in a different place, but a 
are just as many dots as before. They only look different. See, I can put the 


4 , > Ë " ar as 
back just the way they were, so you see, there are still the same number 
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FIGURE 3.2. Arrangement of corks used by Beilin o o o 
(1965) to study conservation of number. A B 


before because 1 did not add any dots or take away any dots. I only moved 
them [p. 326]. i 


This verbal training was effective in increasing conservation of number and 
length, on which the training was given; but there was no transfer to con- 
servation of area, on which no training was given. Beilin suggested that the 
essential feature of the verbal-rule instruction was its algorithm feature: 
“It provides S with a model or rule for processing relevant input data [p. 
3371.” 

In Beilin’s experiment, those childre 
tion at all and those who showed almost complete conservation did not 
improve much, The training was most effective for those children who were 
action between learn- 


n who on pretest showed no conserva- 


in a transition state, a result that emphasizes the inter 
Ing and maturation. 

One of the most thorough expe 
by deliberate training was carried \ 
kindergarten children, using several standardiz 
nds of training. Reversibility training consisted of repeated demon- 
be returned 


riments on the acceleration of conservation 
out by Goldschmid (1968) with 110 


ed measures of conservation 


and two 
strations that after an object's shape has been changed, it may 
fo its original form; compensation training showed the child that when one 
dimension was changed (when the ball of clay was flattened), there was a 
compensatory change (the ball became longer). The training was done in- 
dividually for three and a half hours over a three-week period. The ex- 
that were trained on three tasks registered definite 
improvement on three other tasks which were similar, and the improvement 
less similar conservation tasks. Reversibility train- 
e than compensation training. On a follow-up test two 
years later, the experimental groups were still superior to the control group. 

One of the minor mechanisms in the Piaget machinery of thought is cen- 
tration, the tendency of the young child to center attention on one salient 
attribute of an object. In a test of conservation of liquid, for example, the 
attention of the preoperational child may be captured by the height of the 
liquid in the glass to the neglect of other attributes, while the older child 
is able to decenter and check the other attributes. Ie might notice a decrease 
in width compensating for the increase in height, for instance. 

These little tasks, that look so simple to an adult or an adolescent, who 


riment e 5 
perimental groups 


also transferred to other, 
ing was more effectiv 
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knows what to look for, are perceptually quite complicated to children. From 
the young child’s point of view, it appears that he is asked to make a judg 
ment of multidimensional stimulus objects without knowing which dimen 
sions are critical. Evidence of this complexity comes from an experiment 
carried out by Kenney (Bruner, Olver, and Greenfield, 1966). This showed 
children pairs of glass containers of water and asked which of each pair was 
fuller and, later, which was emptier. In these simple comparisons there were 
many perceptual attributes that the child could observe, such as height of 
the glass, water level, water volume, and whether the water reached the top 
or not; some of these visible attributes supported the correct judement while 
others were misleading. When all the attributes supported the correct judg- 
ments, nearly all the judgments were correct, even at age five, Less perceptual 
support yielded fewer correct judgments and, when all the perceptual cues 
were misleading, nearly all the Judgments were in error, until age eleven. 

The inability of the young child to decenter can be overcome by ap- 
propriate training, like so many other inabilities. The appropriate training 
in this case consists of practice on a variety of problems, called oddity 
problems, with distracting stimuli (Gelman, 1969). Children of five who 
had failed conservation tests learned not to attend to the irrelevant features 
of oddity problems and then became successful on conservation tests. The 
gain transferred to other conservation tests and was still in evidence two 
or three weeks later, 

Does this mean that an improvement on conservation tests should be 
described as an improvement of attention? Not necessarily, If children achieve 
conservation through one of the other kinds of training described above and 
a test of attention is administered later as a dependent variable, it would 
probably appear that directing attention to the relevant attributes is an 


of these to assessment of quantity. As 
ment to the adoption of a quantity cri- 
hen cues such as height are found to be irrelevant, 
but when height and breadth jointly are found to be consistent with quantity 
[p. 302].” Most conservation tasks do not require a precise knowledge of 
height and breadth but they do require knowledge of the effects of change 
in the system. Knowledge of the system means, foe example, the knowledge 
that if height is increased and breadth remains constant, quantity increases; 
if height is increased, a compensating decrease in breadth would conserve 
quantity. Halford suggests that children learning to use quantity indicators 
alone would not be learning the quantity concept but rather a sophisticated 
rule which enables them to do without it. 

The samples of training studies reviewed here, and many others (Elkind 
and Flavell, 1969), suggest some general conclusions. The developmental 


Halford (1970), put it, “The impedi 


terion is removed, not w 
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stages described by Piaget are susceptible to change by special training under 

avorable condition, to be sure, is that of the 
a higher stage by himself. 
Apparently the carly training studies were not as carefully carried out as 
the later studies, but it is evident now that definite gains can follow from 
well-planned programs. There is agreement also (Brainerd and Allen, 1971; 
Goldschmid, 1968) that reversibility training is the most effective. One 
ations facilitate the achievement of 
the multidimensional system and thus are a significant part of the processes 
by which transition occurs under natural conditions. 

In general terms, this transition must depend on an interaction between 
the internal maturational forces and the external effects of experience in a 
specific environment; but apparently the optimal environmental conditions 
et been adequately determined. It is a hopeful 
sign that experimental psychologists are turning their attention to cognitive 
development and beginning the unspectacular task of manipulating materials, 
instructions, and conditions (as in the preceding chapter) in order to 
s of genuine significance. 


certain conditions. The most f 
child in a transition stage, about to move into 


could speculate that reversibility oper 


are complex and have not y 


examine cognitive achievement 
we n a 1 

THE DEVELOPMENT OF REPRESENTA TION 
ng of the development of logical structures, 
Bruner and his associates (Bruner, Olver, and Greenfield, 1966) under- 
took to map a parallel and sometimes coincidental, development, which they 
call the development of representation. How does the child represen! the 
world? How does this representation change woh age and experience? How 
is it influenced by the development of languages a ; 

The earliest form af representation, corresponding to Piaget’s sensorimotor 
sti : ‘od rathe an thought,” Bruner te 
Stage, jn which things are “lived rather tha ; ae terms 
enactive, The young child knows where an ie = any cure he per- 

aj . - se R ut also because t ach- 
Ceives it against a stable perceptual background f ble b k he reach 
ing movements that attain it are launched from a stable ackground of 
balanced neuromuscular tensions: One might ah if o sa r; talk of 
representation at this age, that the cognitive tas% $ foe ; bs rE A 

è s age, oe ‘ y 
of this internal neuromuscular representation and replace it by a more flex 


i x ssentation. 
ible, perceptual, action-free representatior : 
n, which 


second stage in representato’ 


mpressed by Piaget's mapp! 


Bruner terms iconic, begins to 
emerge in the second year when the child ge e em] to himself by 
images or i hemes that are relatively independent oF action. Iconic 
Se s or spatial sc hemes sre he ke images are projected fiom the: child's 
resentation is egocentric in tha 3 E 
own point of sieve anid colored by his likes and erry By adult standards, 
iconic representation is superficial. The young s 4 Seater to make 
Sense out of a complex display may grasp at concrete etails depending on the 
sali <j; and fluctuations of attention, or at obvious global 
alience of the stimuli an rs Gg more likely to grasp th 
characteristics, whereas the older child 3s y grasp the full 
; 
schematic structure. 
An image, like a picture, 


may be worth a thousand words, but it is the 
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“unpicturable” categories associated with symbols that represent that world 
most effectively. This symbolic form of representation is built on enactive 
and iconic foundations by cultural Cespecially linguistic) processes. “Thus 
Bruner’s tracing of the path to symbolic thinking touches on more of the 
linguistic precursors than Piaget’s. 

The linguistic accomplishments of a child of four—the ability to under- 
stand and produce sentences according to complex rules, and the accompany- 
ing motor skills—have aroused much wonder and, recently, sophisticated 
research. But an even more subtle accomplishment, as Bruner points out, is 
the child’s ability a little later to adjust his thinking to the syntactical 
requirements of the language and to use the sentence 


to express his thoughts. 
To differentiate words from objects and to use 


words instrumentally to 
refer to objects, even to objects not present or to the 


is to gain a powerful semantic leverage. 

To examine the development of iconic representation and the transition 
to symbolic representation, Bruner and Kenny (Bruner, Olver, and Green- 
field, 1966) gave children a double classificat 
of nine glasses displayed in Fig. 3.3. The glasses varied by three values in 
height and three in diameter, To acquaint the child with the matrix, the 
experimenter removed a few glasses and asked the child to replace them. Af- 
ter the replacement task came a reproduction task: the glasses were scrambled 
and the child was asked to build “something like what was there before. 
Then came a transposition task: the glasses were scrambled again but the short 
thin glass in the southwest corner was now 
and the child was asked to build some 
leaving the one glass where it h 

The replacement task was e 
three; but the reproduction tas 
of the matrix and the guid 
Few children below fiy 
virtually every 


ir abstract properties, 


ation problem, with the matrix 


placed in the southeast corner 
thing like what was there before, 
ad just been placed. 

asy, except for the youngest children of age 
k seemed to require the retention of an image 
ance of behavior by this image (see Fig. 3-4). 
e succeeded. It seemed to be a copying task because 
successful child reproduced the matrix in its original orienta- 
tion with no transposing of dimensions; in fact, the children said they were 
trying to remember where the glasses had been before. Transposition was 
much harder, for rigid image-bound representation was not sufficient. Ap- 


FIGURE 3.3. Matrix of jine 
plastic beakers used in the study 
of representation. From Bruner, 
Olver, and Greenfield (1966). 
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FIGURE 3.4. Performance of children, ten at each age, on three tasks. From Bruner, 


Olver, and Greenfield (1966). 


d of verbal formulation, 
a gave evidence of a 
nsposition succeeded 
duction succeeded 


parently the transposition problem required some kin: 
and this was not common before age seven. The dat: 
scale of sorts: those subjects who succeeded on trar 
on the other two tasks, and those who succeeded on repro! 
On replacement. 

The methods for studying iconic representation are not yet well established, 
and the logic of the above demonstration is not altogether clear because it 
depends on prior knowledge of the requirement for success on each task. If 
the reproduction task requires imagery and imagery is not handled well 
before age five, this argues for imagery at age five. Likewise, success on the 
transposition task is taken as evidence for symbolic processes, but it is con- 
ceivable that a child with good visual imagery, unspoiled by verbal transforma- 
tion, could construct an image of the matrix, tilt it over in imagination, 
and read the location of the glasses off the transposed image. An in- 
dependent check on the imagery is much to be desired and, since imaginal 
representation has been reemphasized by Bruner and others, further work 
will probably tighten up the methodology- 

Performance on this problem can be improved, both in the reproduction 
by pretraining on each of the two dimensions 
separately (Darnell and Bourne, 1970). The pretraining was more help- 
ful to children above average in verbal ability than to those below average, 


ntéiie Neca ability were not evi ri a 
he effects of superior verbal ability w t evident during pretraining 


and the transition phases, 
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since all the subjects succeeded on the simple pretraining tasks. More 
likely, the effect of verbal ability consisted in the application of the com- 
ponents acquired on the pretraining tasks to the matrix problem. 

The development of representation by imagery and then by symbols ap- 
pears again in an experiment by Olson (Bruner, Olver, and Greenfield, 
1966) involving a matching problem with a matrix of light bulbs, some of 
which made a pattern—for example, a T—when lighted (sec Fig. 3.92: 
The subject had a similar matrix of bulbs which were unlighted but could 
be pressed one at a time to light up the bulbs in the pattern of the model. 


The easiest problem presented a single pattern of lights and the subject 


pressed bulbs: to see 1 he coma make a acer tw mech she model. On a 
problems, two or more models were presented and the subject had n E 
cover, by pressing his bulbs, which one he could match. The older chil iei 
did’ better ‘than the young children, to be sure, but the sequences of ri 
attempts permitted inferences about the strategies made possible by the 
additional mastery of images and symbols. 

One strategy was a simple search; without regard to the model, d 
ject pressed bulbs to see what would happen. Some bulbs were pee 
over others, so the search could not be called random, but it was not ja 
to the information in the model. This inefficient strategy—if it can j 
called a strategy—was typical of children of three (see Fig. 3.6). b- 

Another identifiable strategy was a successive pattern matching. The pase 
ject pressed light bulbs suggested by one of the models and tried to re 
out a pattern, without considering alternative patterns in other mo ne ‘ 
This strategy, which was common at ages five and seven, was more efficien 
than the search, but contained many uninformative moves because the 
lights in different patterns overlapped; that is, turning on a light contained ie 
two patterns or in neither does not furnish any discriminative information. 

The most efficient strategy was a more deliberate information selection- 


the sub- 


patterns 


FIGURE 3.5. Apparatus presenting From 


A A i ts. 
of light bulbs for matching experimen 
Bruner, Olver, and Greenfield (1966). 
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FIGURE 3.6. Performance of children, 10 to 15 at each age, using 3 types of strategies. 
From Bien Olver, and Greenfield (1966). 


e nonredundant light bulbs that gave discriminative 


The subject chose ghios the models. The strategy was adopted 


information about the Pad om a fifth of them. 
A ; nly by a 
only by subjects of Hine, A00 OP? i : s 
Gus y a A ren ai he same problems, under a constrained condition that 
her children ha 


permitted them to press only one bulb at a time, after bp on a e 
would ask, “Now do you know which nee a aie tae onon or 
constrained condition was designed E ens deliberate under these condi- 
informative strategy. The children were gene Scruton adiction, Sate 
tions and, as Fig. 3.6 shows, the use of the in a BY 


S ] derably amor and nine-year-olds. ‘The: conclusion 2 
increased considerably ° : 
he subject h 


j T as an image corresponding to the 
that in pattern matching nide his movements. The information selection 
model and uses this tO S iatt conceptualization of the properties that 
pared requires a pn he so that the ns i symbolically 

istineui veen dure! er tha: S 
rep cog ba critical features of the patterns rat n respond to 
resent separate 
the images as wholes. s fluently to represent familiar objects, 
Even when children a Š pile by perceptual rather than abstract 
Olver and Hornsby (Bruner, Olver, 
illustrated this point by a simple procedure ap- 
2 ubject is shown the words banana and 
iate ixton are 
propriate for ages oe "re banana, peach, and potato all alike?” Words are 
HE and oie i ts asked until the list consists of banana, peach, 
adde imi ues 
and simi an g ner Bi and germ. These words, and those of other 
potato, meat, milk, wate" tual similarities (“they are both yellow”), b 
lists, can be grouped by perceP noy 


seven- 
ng se 


and Greenfield, 1966) 
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functional similarities (“you can eat them”), and so on, but perhaps more 
i . As the 
perceptual grouping gives way to the functional, with increasing age, the 
complexive structure gives way to the superordinate. A superordinate struc- 
ture puts the items in a higher class. Complexive structures are tied together 
by associations or varving similarities but without a consistent common 
attribute: “Banana and peach are both yellow, peach and potato are round, 
potato and meat are served together, meat and milk both come from cows.” 
The longer lists are harder to group in one superordinate class, of course, so 


interesting is an analy 


of the structure or syntax of the grouping 


around age twelve most subjects formed superordinate groupings for all 
easy lists but complexive groupings for many of the hard ones. Similar 
studies with pictures to be grouped tell the same story, The six-year-old 
sees equivalence in perceptual terms, perhaps with the aid of imagery. With 
the development of symbolic representation in the older child, according 
to this account, linguistic structures increasingly guide what and how things 
will be judged alike. i 

It is difficult, of course, to validate the exact contribution of language to 
this developmental process bevond the possibility of alternative interpreta- 
tions. Piaget (1967), in his turn, has taken up the weapon of methodological 
criticism to locate errors in results that are inconsistent with the Genevan 
results, especially on the role of language. With all the instabilities that have 
turned up during the transitional period, however, and all the language 
that children are exposed to at this time, there is plenty of opportunity 


for linguistic acquisitions to slant cognitive development to a significant 
degree. 


Language and Thought 


The language-thought problem, like the mind-bodv problem, has changed 
over the centuries of controversy, but in the annals of speculative psychology 
certain questions persist CCarroll, 1964a; Jenkins, 1969; Révész, 1954). Is 
thought possible without language? Is la 
Are language and thought the same? 
chologists who have explored such 
closely related to his language, 
other symbols. But the relation 
viewed from several angles, he 


nguage possible without thought? 
The many philosophers and psy" 
questions agree that man’s thought is 
that is, to his use of a system of words and 
ship between language and thought can be 
i nee a clarification of the question is part © 
any answer. In addition, the Progress of psychology has produced relevant 
empirical evidence (some of which has been mentioned on preceding pages) 
and there are fresh ways of viewing the problem. However, since many of 
the arguments about language and thought still sound like comparisons of 
two reified mental faculties, the discussion needs first to be placed in a 
biological context, along with other Psychological functions. 

al organization that different functions 
not completely independent nor completely redundant, but interdependent 


It is the general rule of biologic: are 
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and overlapping. The nervous system and the circulatory system work 
together to integrate the activities of the organism, as in the control of 
activity level; there is some overlap in function but they are not redundant 
systems. Similarly in the normal person, vision and audition function inter- 
dependently to supply information about the environment; each helps the 
other in certain tasks, such as the localization of objects, but not in all. If 
one system is inoperative, the other can take over some of its duties but 
not all. Language and thought are more recent, highly specialized develop- 
but by this analogy we would expect a 


ments in the evolution of man, 
nonredundant, nonparallel, and nonin- 


similar overlapping organization, 
dependent. We would look for the development of interactions between 
thought and language because both are functions of one living organism. 
Many who have struggled with the language-thought question have com- 
plained about the shifting usage of each term. The nature of thought and 
its various meanings—mental experience, problem solving, mediation—were 
discussed in Chapter 1. The nature of language is also complex, but a few 
Features are especially relevant to the present discussion. In the normal adult, 
the function of language includes communication between individuals. Con- 
tent words, like nouns and verbs, that refer to objects and events are obvious 
features of language; but function words, like prepositions and conjunctions, 
that do not refer to anything but relate words to other words must also be 
included, Above all, language is characterized by structures, such as phrases 
and sentences, that are more than lists of random series of words, yet a 
great variety of sentences can be generated from a few basic structures by 
standard transformations. Thus language involves a skilled performance by 
the speaker Cor writer) and a skilled performance by the listener Cor reader), 
who share a competence in the standard linguistic forms underlying the 
specific performances. 


If we consider thought in the sense 
introspection as the method of inquiry. By this method it appears that our 


thoughts consist often of words and other linguistic forms. But introspection 
also reveals nonlinguistic thoughts, such as smells, visual images, and in- 
effable fears, so thought and language cannot be equated by this method. 
Introspection is not a convincing guide. And yet, if we wish to explore mental 
experience at all, we must admit that both linguistic and nonlinguistic 
thoughts are frequently experienced and we cannot say that either class 


of mental experience, we must trust 


of thoughts is unthinkable. 
If we consider thought as problem solving, 
problems that man has to think about are verbal problems: achievement of 


the concepts and conceptual systems of the arts and sciences, reading and 
evaluation of abstract arguments, construction of effective sentences, organi- 
zation of paragraphs, comparison of alternative sentences, and debates 
with worthy opponents. All these problems appear in verbal form; thus their 
solution requires the ability to read and write, to listen and talk. There are 
other problems, of course, that do not appear as verbal problems: the dis- 
crimination of figure classes, aS well as arithmetic, spatial, construction, and 


it is obvious that many of the 
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DEVELOPMENTAL INTERACTIONS 


The interaction between thought and language must be a dynamic one. 
As the child grows older and more intelligent and acquires more of the 
adult culture, he becomes more sophisticated in both functions. The per- 
sistent ]anguage-thought problem can be raised again in the form of a ques- 
tion about the sequence of development: Does language develop first and 
influence the development of thought, or does thought develop first and in- 
fluence the development of language? 

As soon as the young child starts to respond to adult voices and to direct 
his babbling toward adults, we can say that language is developing. Some 
utterances at this time may be considered attempts to solve a problem, such 
as retrieving a fallen toy, and these attempts are not much different from 
a child’s tug on his mother’s arm or a dog barking at a closed door. If 
thought means problem solving, this is thought of a simple childlike kind. 
But there is no reason at this time to infer that the language is influenced 
by cognitive mediators, internal representations of the environment, OT 
conceptual thought. Reaction to adult voices at this time is similarly direct, 
not mediated. This direct, interpersonal, linguistic performance might be 
called preconceptual language or, if thought means cognitive mediators, 
language without thought. 

The most extraordinary phenomenon—one that has puzzled both linguists 
and psychologists—is that normal children acquire the basic structures of 
sil native language by age four or five, when they are characterized by 

laget as preoperational and before they can solve simple problems like 
tieing their shoelaces. Even children of low intelligence, with IQs of between 
50 and 75, learn the language moderately well. This is not to say that their 
language includes words for i 
construct sentences according to rules so complex that linguists have not yet 
worked them out satisfactorily. Nor does this mean that children can state 
the rules for generating sentences, only that they are capable of complex 
linguistic performance. Such performance gives evidence of the development 
of language skills without corresponding ‘development of thought. 

As we noted in the section on cognitive development, there is reason tO 
p young child acquires some simple cognitive structures for 
representing what he sees and feels before he learns to refer to them by 
name. These structures are limited in the early years to concrete, perceptual 
schemas, but they act as mediators in facilitating adaptive responses to dif- 
ferent views of the same object on successive PAE and to classes of 
perceived objectives. The role of such mediators is difficult to evaluate, but 
if they are operative at this early age, this operation could be considered 
prelinguistic thought. s 

Thus, if these accounts are accurate, in the first few years of life the 
child develops a language facility and a repertoire of cognitive mediators 
to represent his world—perhaps to think about his world—but the two are 
distinct. As to individual differences, correlations between the conventional 


r . e y 
abstract concepts and principles, but that they 
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intelligence tests (supposed to measure cognitive accomplishments) and 
the conventional measures of language skill, such as length and complexity of 
sentences, are quite low during these early years (Furth, 1966). The next 
question is how the child integrates these two functions so as to develop 
the relatively smooth interactions that are observed in the normal adult. 


VERBAL FACILITATION OF INTELLECTUAL DEVELOPMENT 

The block-sorting test described on page 97 was designed by Vygotsky 
(1962) in the 1930s to study the intellectual progress of Russian children. 
Unlike conventional figure concept tasks, this task begins with a word for 
a block, and thus a symbol can orient the conceptualization of a class of 
blocks. Learning to direct one’s own mental processes with the aid of words or 
other symbols is an integral part of the process of concept formation, according 
to Vygotsky. , , 

The role played by speech in intellectual development is dramatized by 
the egocentric speech of the child, which has been considered the beginning 
of socialized speech. As Vygotsky observed the performance of children of 
different ages on the block test, he rejected this sequence and asserted that 
socialized speech comes first. Egocentric speech splits off from general social 
speech, and this split leads to inner speech, which now has a mediating 
rather than a social function. All the higher functions are mediated processes, 
according to this view, and language is the basic means used to direct and 
control them. Words are the means of actively centering attention, abstracting 


attributes of objects, synthesizing them, and symbolizing them. 
's view, the word is not an arbitrary name 


For young children, in Vygotsky 
for a class of objects; it is an attribute of the objects, like size, shape, and 
color. Only later, at adolescence, is this verbal absolutism replaced by the 
more formal, adult type of meaning, and even then, after true concepts are 
formed, the elementary forms of thought are not completely abandoned. 
Thus the relation between words and objects is a dynamic one, changing 


with the child’s maturity, experience, and education. Progress in thought ze 
Progress in speech are not ‘parallel. Rather, their growth curves cross an 


recross, In time, the meaning of a word joins thought and language _ closely 
j f a g anguag 
that it is hard to say whether thought depends on language or language 


depe d 
n g. 
pends on thought. in the development of thought has been 
The importance of language in the develop descri o 
emphasized by Russian investigators since Pavlov 3 me flexes of the low 
Signal system. Pavlov considered the ordinary conditioned reflexes o i he ower 
\ not adequate to explain the complex behavior 
a signal of a signal. The appearance of a light 
a signa g : 
word light can be a signal for the 
hus as language introduces 


ption of the second 


animals as a first signal system, 1 
of man for whom a word can be 
can be a signal for food, but in man the 


light which is in turn the signal for the food. TI deien A 
Words for objects, he responds less often to the objects directly and more often 


to the verbal signals of objects. 
Luria (1961) and his students, who hav 
of behavior in Russia for many years, bega 


se been studying this verbal control 
n with Vygotsky's assumption that 
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the most important mental activities result from the child's social cy. 
in the course of which language develops and acquires control over be na 
An adult draws attention to an object, for example, by pointing to pow 
this object becomes the controlling stimulus for a response. If at the a 
time he names the object, the name gradually acquires the power to — 
the same response. Later the child himself pronounces the nime gee 
the response. Thus complex mental activities, bound up with speech, 5 i 
initially carried on slowly and only by external audible communication u 
adults: but they gradually speed up as they are interiorized in the form 0 
thought rather than speech. o — 
To demonstrate this development, Luria arranged conditioning expentiney : 
with a simple hand movement, such as squeezing a rubber bulb, as tbe A 
sponse, and lights and words as stimuli. Children in the second year can ba 
conditioned to squeeze when a light appears, but more complex tasks requ 
extra verbal instructions from the experimenter. If the task is to were 
relax, then squeeze again, the performance is facilitated by telling the chi r 
to say, “Push, Push.” If the child of three or four is conditioned to ait to 
a red light, the presentation of a yellow light may evoke a different response 
“Shall I press for that one too?” Following appropriate instructions he me A 
a rule, such as: “I am to press for a red light and not for a yellow ae 
The younger children state these rules aloud while the older ehnldeen se 3 
do not, but even older children are occasionally heard to say “No, No” a m : 
in order to inhibit a well-practiced response. Whereas associations ae 
gradually in animals and very young children, by age four or five they ear 
be formed at once, even if fairly complex, by the incorporation of the giv 
signals into an existing verbal system. jaf 
Thus language facilitates thought as the instructions for the — 3 
behavior are understood and acted upon, then as they are repeated a oe 
and then finally as they mediate behavior soundlessly. In the third Ao 
Luria believes, as the control is transferred from external to internal T 
the regulatory function is transferred to the system of meaningful p 
provided by speech. The analytic flexibility that derives from the abstracting 
and generalizing functions of language becomes the property of thought. ” 
American behaviorists also have adapted the machinery of amalu 4 
associationism to the investigation of verbal facilitation of intellectual deve p 
ment. According to the Kendlers’ hypothesis about verbal mediation outlin Is 
in Chapter 2, children learn words in ordinary communication, then the wales 
mediate the acquisition of class concepts. While the child of five et 
directly to objects, the first response of a child of ten may be an impit 


j A i asses 
verbal response which, because it has previously been associated with cla 
of objects, evokes behavior appropriate to classes. 


NONVERBAL COGNITIVE STRUCTURES ; aint 

Those who have studied cognitive development seriously are likely to pe 5 
out that the early cognitive structures influence intellectual Se OEE a 
general and facilitate the later acquisition of words for these structures. Piag 
g 
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in particular has reported many examples to illustrate how the young child 
at the sensorimotor level represents objects and events by movements of his 
hands and feet, sometimes even by playful imitation of actions that interest 
him. The sensorimotor schemas that control such behavior can be considered 
mediators in a sense—though cognitive theorists do not typically use this 
term—but they are private, idiosyncratic representations, not verbal mediators 
at this stage. j 

In Piaget's (1954) discussion of language and thought from the genetic 
point of view, sensorimotor schemas confer significance of a kind on objects 
and events but only as indicators or signals of them, just as, in the conditioned 
reflex, the conditioned stimulus is a signal for the unconditioned stimulus. 
They are the preconditions, along with images however, of the true symbolic 
function, which distinguishes the signifier from that which is signified and 
thus permits thought of an object even in its absence. Socialized language, 
which develops later on the basis of these early types of cognition and inter- 
action with adults, is only one kind of symbolic function; but it transforms 
thought profoundly by facilitating the attainment of thoroughly abstract 
schemas that can be used more flexibly. When this level of intellectual ma- 
turity is reached, according to Piaget, asking whether thought precedes 
language or language precedes thought is as hopeless as asking whether the 
river controls the banks or the banks the river. But earlier, before age three 
or so, there is supposed to be a nonverbal state of cognitive development that 
precedes and makes possible the later acquisition of language. Each child has 
to construct reality for himself, and his constructions guide his appropriation 
of the language that he hears. 

Specific illustrations in support of this position have been supplied by 
Sinclair-de-Zwart (1969), working with Piaget and Inhelder at Geneva, from 
her experiments on the spontaneous use of quantitative terms by children 
who have achieved conservation of liquids and seriation and children who 
have not. In describing different quantities of Plasticine, the conservers used 
mostly comparative expressions: “The boy has more than the girl.” The 


a. 
nonconservers used absolute terms in the same situation: “The boy has a lot. 


i i Si ere able to construct sentences that co- 
The girl has a little.” Conservers were abl 


ordinated two dimensions of objects: “This pencil is long but thin; the other 
is short but thick.” Most of the nonconservers referred to only one dimension 
in each sentence, Verbal training for the nonconservers produced some im- 
provement in their verbal descriptions, but this was unstable and did not help 


them to acquire conservation. 


Bruner (Bruner, Olver, and Greenfield, 1966) follows Piaget in recent 


speculations on this question but adds the vo a of Vygotsky on others 
A ment more intima 

to link cognitive development and language deve T PO welt tely. The 
link is the referential function, by virtue of Be Yi 5 refer to other 
things, for this is the basis of both language an i “pe ee OBR beyond 
enactive and iconic representation requires some social training in the use of 
language. Not that language alone can mold the raw dough of reality; some 

BUA ge. NOE tab sapr o, y. But when perceptual diff i 

degree of cognitive organization 1S necessary. P erentia- 


126 ¥ COGNITION AND CONCEPTS CONTINUED 


tions are taking shape, words can reorder cognition by only a gentle nudge. 
The availability of terms like large and small, for example, can nudge cogni- 
tion of magnitudes into opposing categories; at a later time the availability of 
a term like length can draw attention to the continuous dimension. 

As it happens, syntactic development outruns semantic development, ac- 
cording to this account, so that at age five children are quite skillful at 
organizing sentences but less skillful at conceptualizing the things the words 
and sentences refer to. Their language has more potentialities than thev 
can utilize at this time, but with experience and social interaction their 
sometimes empty sentences will fill up with meaning. The continuous ad- 
justment of speech to thought and thought to speech in social interaction 
sharpens both and enables the older child to solve the developmental prob- 
lem raised by Piaget, which can be phrased as the child's matching of his 
extensional classifications to his intensional sentences. 

Although many scholars have stated that words slant thought, Bruner 
adds that the more complex structures of langu 
slant thought in more complex ways. Once the child has mastered the 
language of his culture and has coded his own experience in this language, 
the syntactic rules may endow his sentences with a surplus meaning that 
he does not at first understand. It is when a child listens to what he says. 
as he has listened to what others say, that he reorganizes what he thinks. 
Bruner’s position on this critical issue is revealed in the following paragraph: 


age, such as the sentence, 


Until the time that this “surplus meaning” is read off from our linguistic 
coding of experience, language and experience maintain an important inde- 
pendence from each other. A child can say of two quantities that one is greater 
than another, a moment later that it is less than the other, and then that 
they are the same—using his words as labels for segments of experience. It 7 
not until he inspects his language that he goes back to his experience to chec 
on a mismatch between what he sees with his eyes and what he has just saie- 
He must, in short, treat the utterance as a sentence and recognize contradict 
at that level. He can then go back and reorder experience, literally see the wor 


pat y virtue of symbolic processes reordering the nature of experience 
pp. 51-52]. 


THE SPECIAL CASE OF DEAF CHILDREN Beate 
The intellectual performance of children who have been deaf since 
birth, or shortly thereafter, is particularly relevant to the question of lan- 
: . - ative 

guage and thought since their deafness deprives them of the cumulativ 
practice in listening and talking enjoved by normal children during their 
preschool years. With special remedia a a little 
language by age sixtecen, but in car 
of this world as best they can w 


dial training they may acquire wl 
| adie a problems 
ly childhood they adjust to the proble 


: 2 z th 
ithout the aid of language. Oléron (1957) 
therefore made several comparisons between the 


children aged four to seven and deaf dulas 
roughly in scores on intelligence tests. 
studied bv requiring the children to 


i na 
accomplishments of nori 


squated 
of the same ages equat 


“as 
Problem-solving performance W 


manipulate various combinations 
age 
latches and bolts to open a problem box and obtain a bonbon. Languag 


of 
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might or might not be helpful on this task; in fact, the deaf children 
without language did as well as the normal children. 

Even simple laboratory problems can be solved in more than one way; 
the important question is whether deaf children solve problems with 
mediators which normal children designate by words. To investigate this, 
Oléron turned to a task often employed in studies of discriminative learning: 
a pair of squares is presented and the child is rewarded for choosing the 
larger. Then, on the transposition task, another pair is presented to see if 
the child chooses the larger of these. Typically, young children make an 
absolute response, choosing the stimulus previously rewarded, while older 
children make a relational response, presumably because they are familiar 
with the word larger. The question is whether deaf children, who do not 
lations from a concrete situation and 
Another task involved two cylinders 
varying in weight to determine whether children can acquire and transfer 
the relation of heavier when they do not know the word heavier. A third 
task involved the relation of faster. Children learned to choose the faster 
of two rotating discs; then the transfer tasks involved two lights of varying 
speeds of alternation and two miniature trucks descending an incline at vary- 
ing speeds. Even though the normal children used words like larger, heavier, 
and faster, which the deaf children did not use, there was little difference 


in the performance of the normal and the deaf. The older children were 
but there was no consistent 


use such words 
apply them to a different situation. 


acquire abstract re 


superior to the younger children, of course, 
difference in rates of development. 

Another type of task in Oléron’s experiments depended on mastering 
the rules for series of choices. The double alternation task required choosing 
one of two squares according to the sequence L, L, BR, R, L, L, RB, RB, and 
So on; triple alternation involved the sequence L, L, L, R, B RL LL, 
and so on: and there was a series of eight boxes to be opened according to 
simple rules about spatial-temporal sequences. When children have to gen- 
erate rules from experience and apply them to later tasks one would ex- 
ld be helpful, but in fact the deaf children again 
l as the normal children. There are methodological 
deaf and normal children for these com- 
parisons, but evidently there were no large differences in performance. The 
results indicate that children without language can handle certain tasks that 
presumably involve the mediation of relations, rules, and other simple cog- 
nitive structures about as well as children with language. 

The special conceptual difficulties of the deaf come to light on tasks 
that involve word meanings, as in an experiment requiring extension of the 
meaning of a word (Oléron, Gumusyan, and Moulinou, 1966). Deaf 
children of five, six and seven were shown common words in association 
with pictures of corresponding objects and events. When different pictures, 
Tepresenting positive and negative instances of the classes designated by the 
words, were presented, the deaf children were unable to generalize 65 the 


pect that language wou 
performed about as wel 
difficulties in equating groups of 
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new positive instances as well as comparable children with normal hear- 
ing. At age five there was little difference in performance on this task, 
but at age six and seven normal children were quite successful, while the 
deaf children’s performance was characterized by many errors of nongen- 
eralization and a few errors of false gencralization. The authors explain 
that what the deaf lack is not just the direct connection between a word 


and its referents but the complex tissue of relations between objects and 
their properties which language makes available to the normal child dur- 
ing these vears. The deaf do not participate fully in the linguistic milicu. 

A hopeful implication can be gleaned from these results, however. The 
experiment consisted of three series of instances; in the second, errors were 
corrected. During the course of the experiment the deaf children, even the 
five-vear-olds, improved markedly. They did not improve as much as the 
normal children, to be sure, but their performance on the last series was 
much superior to that of the normal children on the first series. Obviously 
they were able to broaden their comprehe 
set of semantic exercises. 

Despite their severe language handicap, the deaf do get along. According 
to the evidence assembled in a recent book by Furth (1966), with the 
provocative title Thinking Without Language, average achievement in 
silent reading at age sixteen is only at the level of normal children in 
the third or fourth grade; and yet adults who have been deaf all their lives 
do solve the routine problems of daily existence. Although they do not get 
into the professional occupations, as some blind people do, they hold steady 
jobs, typically in the low-pay, low-status classifications. Hence Furth raised 
the question of how they can make as good an adjustment as they do. When 
he compared the deaf and the normal, 
results with American children th 


nsion of common words by a short 


he found approximately the same 
at Oléron had found with French chil- 
dren. On such tasks as reversal shift and double alternation, which are pre- 
sumed to require verbal mediation, the deaf children were about equal to 
their agemates of normal hearing. Although normal children at seven are 
learning words that could serve as verbal mediators on many conceptual 
tasks, no noticeable gap between the normal and the deaf appeared at this 
age. But a difference did show up on certain tasks, such as one involving the 
abstract relational Concept of opposition, on which the normal children could 


be aided by knowledge of a specific word, opposite, that the deaf children 
did not know. 


The conclusion from these co 
acquire certain cognitive struc 
lations closely tied to visible 
other problems of the same ty 


mparisons is that young deaf children can 
tures—specifically, spatial and temporal p 
objects—and apply them to the solution © 
pe. If we define concepts to include the asso- 
ciation of the cognitive structure with the standard label used in communi- 


cation, these are not complete concepts, but they are cognitive structures 
that mediate or control behavior. As these children grow older and en- 
counter problems that depend on more abstract lans their handicap 
becomes more serious because the more abstract concepts that normal 
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children are achieving at age ten are not tied to visible objects and cannot 
be learned without good verbal communication. The deaf adult must have 
acquired the cognitive structures that he needs for his low-level adjustment, 
but he acquired them slowly, with great difficulty, and without much help 
from verbal mediators. Furth believes that there is, in addition, a general 
discouragement among the deaf due to inadequate teaching methods directed 
toward linguistic rather than cognitive education. For educational practice, 
the fact that cognitive structures can be achieved with the help of verbal 
mediators may not be as relevant as the fact that they can also be acquired 
without such mediators. 

The more general inference is that, if deaf children can acquire certain 
cognitive structures without language, normal children can also. Or they 
can acquire the cognitive structures first and the names for them later. The 
superiority of the deaf in a few comparisons suggests that some normal 
children may be temporarily retarded by premature attempts to use their 
z Until language and thought are appropriately 


inadequate language facility. 
alities for intellectual development will not 


matched, the expanding potenti 
be realized. 


PRESENT STATUS OF THE PROBLEM 

When the arguments are examined carefully, it appears that those who 
say thought depends on language and those who say language depends on 
thought are not too far apart. Both sides give illustrations to show that the 
dependence does not go exclusively in one direction; the differences are 
matters of emphasis and priority. Actually, there is only a little hard evi- 
dence; the arguments typically rest on examples and counterexamples. It 
has been difficult to design convincing experiments because the hypotheses are 
based on theoretical mediators that are not directly observable. Precise 
methods would be necessary to check on differences that are only matters of 
emphasis. 

In spite of discrepant points 0! 
possible to discern some agreement 
thought: 


f view and the difficulties of research it is 
about the development of language and 


f life these two functions operate independently. 


In the first three years 0 : à 
the referential function of 


Word meaning or, in more general terms, 

symbols, is the link that joins language and thought. 

3. Interpersonal communication is critical for conceptual development. 
Deaf children can acquire the more concrete concepts, but they are 
severely handicapped on abstract concepts as well as on silent reading. 
Similarities between objects can be emphasized for both deaf and nor- 

jointing and by spatial grouping, but only normal 


mal children by p : er g 
children can profit by linguistic grouping in a sentence. 
4. The contribution of syntactics to the development of thought is 
different from that of semantics. A word is perceived as a unit, after 


f the flow of speech, and the class of 


pe 


it has been segregated out 0 


y 
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i i i a sentence is a 
objects to which it refers is conceived as a unit; but a sentence i i 
‘eds rada the 
linguistic structure that relates words to other words, restricts 
nl ord: ggests meaning, and makes assertions. As the 
meaning of words, suggests aning, | A 
child listens to his words, he also listens to his sentences—approximé z 
: Bais eaten dis- 
as he listens to those of others—and the older child can agree 3 E 
i : ar precedes. logie 
agree with what he has said. Developmentally, grammar precedes E 
SAS : PREA ae 
it supplies a framework within which the older child compares 
assertion with another, acili 
i siat acili- 
5. The development of language and thought is one of mutual fa 
tation, with occasional interference. 
6. Since intellectual development is an 
thought, it can be re 
ment of either. 


interaction between language and 
i iti a ` we d 
tarded by suboptimal conditions for the develoy 


Thought and language are | 
fo) ò d 


i > re state 
arge, abstract categories and the above sta 
ments are large 


generalizations, Obviously, achievement of abstract con 
cepts leans more heavily than achievement of concrete concepts on previous 
linguistic achievement. Depending on the specific concept and the specific 
environment, a child may 


acquire a word and use it conventionally in 
sentence before he uses 


it efficiently in thought and he may use a nt 
cept in solving problems before he has a word for it. The specifics of t 


Ree ‘ his 
development of language and thought make large generalizations about t 
question especially hazardous. 


Major Trends and Implications 


When different people 
them, they do not all star 
more by color than by 
in meaning, are impress 


" ` i9 ind 
attempt to categorize the objects and events arot 
t out the 


Y í ei sssed 
same way. oung children are Impress 
form; olde 


à s t rest 
r children and adults, with more inter 
5 $: Š > age 
representative features. Even at the same 5 
categorical styles, such as biases toward perceptual or abstract attributes, 
R . . . © vg: : ters, i Te. es 
distinguish different individuals. But the large, stable individual differenc 
m concept achievement are correlated with 
ee eee : . “tive 
gence or the more Specific abilities known as general reasoning, inducti a 
reasoning, and verbal comprehension, This intelligence difference is mani 


fested in method as well as final performance; children of below-average 
ability plod along by associative learning w 
5 


testing hypotheses, 
As children grow, 

quantitatively by 

tests, and better 


ARR é al intelli- 
abilities, either general intel 


hile those above average are 
their cumulative cognitive development is manifested 
t concepts, higher scores on vocabulary 
ncept learning experiments. But many 
t this development qualitatively, by de- 
gical operations, representational func- 
asurement wrinkles are ironed out, all 

development are likely to be correlated 
with objective performance data over the Years from two to twenty because 
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any sequence that is intuitively satisfying to a well-inf ` i 
n accurate. No rin ere i apear n rta pen 
before concrete classes ake good inferences < T phe ees 
tween two elie sired e pe T a i 
the methods of measurement, the E f the 6 bey aoe e . 
from one to the next ie os ae Gee oe oe ay nye 
a roversy. As in 
motor development, both maturation and experience are necessary, and a 
more mature child can profit from a small amount of experience a ance 
situations just as a less mature child can be pushed ahead by a heavy dose 
of planned experience. j : 
Jean Piaget, leading the 
schemas of the first two years of life, the concrete-operational structures that 
and the formal abstract structures of the ado- 
orderly maturation of these structures, given 
hers have picked up instabilities 
across structures of vary- 


parade at present, has described the sensorimotor 


develop from five to eight, 
lescent. He has stressed the 
adequate environmental support, while ot 
in the hypothesized sequences and inconsistencies 
ing content. 


in purpose produces a contrast in procedures. Piaget 


The basic contr: 
and his colleagues aim to describe cognitive structures at various levels of 
ination procedures to diagnose 


intellectual maturity and arrange their exami 
American experimental psycholo- 


for converting imported 
ocedures so that practice 
c attempts to accelerate 
and so on 
and 


the child's natural achievements. Many 
sists, on the other hand, possessed by a genius 
ideas into learning experiments, rearrange the pr 
becomes the main independent variable. Systemati 
nt of conservation, seriation, addition of classes, 
by planned practice are intended to clarify the nature of the stages 
especially the processes by which each transition to another stage is achieved. 
And the encouraging results to date, obtained under optimal conditions, are 
beginning to stimulate changes in educational practice. 

Achievement of formal operations includes the use of words and other 
symbols to represent reality. As Jerome Bruner tells this part of the story 
iconic representation, making use of imagery, 
and enactive or motoric representation pre- 
ation from verbal has al- 


the developme 


of cognitive development, 
precedes verbal representation, 
cedes iconic, Distinguishing ima 
ways been troublesome, but the re 


ginal represent: 
al of interest in imagery Cin several 
branches of psychology) will stimulate the invention of better methods for 
doing this. The methodological possibilities originate in the fundamental 
differences: iconic representation is extended in space and has some per- 
ceptual correspondence with reality; verbal representation is linear and has 
only a conventional correspondence with reality. The available evidence, 
from experiments with patterns of objects and lights and with words of 
Perceptual and abstract meanings, indicates that iconic representation gives 
way to verbal during the transitional years six and seven. 

Words start arguments. Debate continues on the question whether lan- 
guage depends on thought or thought depends on language. When the 
question refers to adult unctions, however, extreme positions are un- 


viv: 
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tenable and moderate positions indistinguishable. Refering to sual bate 
development, the arguments are more empirical because abundant data can 
be arrayed to support intricate hypotheses about developmental interactions. 
The controversy over verbal facilitation of cognitive development has the 
same ingredients as the controversy over verbal facilitation of the learning 
of figure concepts in the laboratory, 

During the first two or three years of life, language and thought develop 
independently. But evidence has been offered by Russian and American in- 
vestigators to emphasize the role of language in controlling thought as well 
as action from these years onward. Piaget and his followers, on the other 
side, have published many observations and a few e 
the development of the structures of thought is an autonomous develop- 
ment, prerequisite for the acquisition of language, which is ge 
ble and adaptive rather than bound to superficial particul 
his colleagues have made use of the recent insights 
research on perception and language—skipping the 
dence, however—and their own research with childre 
velopmental theory that unites speech to thought. When cognition is 
starting to take shape, the availability of words in the culture slants the 
organization toward certain classes of objects and relations between them. 
At the same time, sentences structure experience and allow the child to 
reflect upon his own assertions as well as the assertions of others. Thus the 
childish thought forms soon become embedded in the more mature forms, 


which are much more convenient and more efficiently represented by the 
language habits of the adult culture. 

Research with deaf children supports the 
development, for concrete perceptu 
time underlines the severe h 
stract achievements th 
cation. Language is t 
even though it is not 

A distant view, 


xperiments to show that 


nuinely flexi- 
ars. Bruner and 
of psycholinguistics, 
verbal mediation evi- 
n to synthesize a de- 


autonomous view of cognitive 
al classes and relations, and at the same 
andicap of the deaf in respect to the more ab- 
at are normally acquired through verbal communi- 


he best approach to the thought of another person, 
the thought itself. 
from above the fields 


of battle, reveals parallel trends in 
the struggle to understand the co 


gnitive development of children and the 


struggle to understand the laboratory acquisition of concepts. In both cases 
the simple maneuvers of the early 


tesearchers have become more compli- 
man beings will do almost anything to 


pre : ging objects and events. Certainly the 
biological maturation of the organism underlies al] development. But once 


the child graduates from discrimination learning to the acceptance of cog 
nitive organization as a personal problem, the psychologist’s task of theory 
building undergoes a corresponding increase in complexity. All the indi- 
vidual’s functional versatility can be trained on the problem: perceptual 
preferences, direction of attention, perceptual differentiation, verbal media- 
tion, strategies of problem solving, and, penetrating at all levels, the linguistic 


offerings of the culture. This does not mean that the development is unlawful, 
but that the laws must be equally versatile. 


cated each year, because flexible hu 
achieve a stable cognition of chan 
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Main Trends and Implications 


hinking characterizes human activity be- 
cause problems characterize the human 
condition: a problem is a special kind of 
interaction between a person and his en- 
~ Hence the first section provides an orienta- 


oblem solving as a topic for psychological 
shall examine a few task variables, 


that have a widespread influence on 
blems in general. But what most peo- 
vant to know about problem solving is the “inside 
the unseen operations by which solutions are 
Hence the bulk of this chapter is allotted to 
ving processes. The next chapter will take up 
d individual differences in problem solving. 


vironment 
tion to pr 
inquiry. Then we 
such as complexity, 
the difficulty of pro 
ple v 
story,” 
reached. 
problem-sol 
training an 


Orientation 


As an attempt at 
arises when a perso 


definition, we may say that a problem 
n is motivated toward a goal and his 
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first attempt to reach it is unrewarding. The motivation may be hunger, 
thirst, sex, achievement, fear, or any other external motive, of course, but 
it may also be a more subtle intellectual motivation, such as curiosity or 
interest in the problem for its own sake. Many people like to manipulate 
puzzles, words, or colors, just for the fun of it. And many creative thinkers 
believe that this intrinsic motivation, this love of the activity itself, is more 
effective than motivation toward an extrinsic goal. 

Frequently the motivation for problem solving is aroused by a question. 
Where is Zambia? Why is intermittent reinforcement so effective?) Who 
am I? 

If the individual, motivated by the demands of the flesh or the spirit, 
reaches his goal on the first try, he has no problem. If he does not reach 
his goal but sees that he is progressing toward it, as when he is physically 
walking toward it or doing something that seems to be bringing it closer 
to him, he feels rewarded, and we may say again that he has no prob- 
lem. But if his first response does not seem to do him any good and if his 
motivation persists, we say that he has a problem. 

In what situations does this occur? Certainly not in those that are 
thoroughly familiar. In familiar or partially familiar situations, previously 
learned habits may transfer so well that the individual reaches his goal on 
the first try or believes that he is making progress toward it. It is in novel 
situations, of course, that problems arise. The individual docs not know 
the path to the goal, the answer to the question, the computations to per 
form, the switch that turns on the light, or, in general, how to get what 
he wants. If the knowledge based on previous learning is inadequate and if 


the path to the goal is not directly perceived, the first response will not be 
rewarding; if the motivation keeps him going, we describe his activity as 
problem solving. 


CHARACTERISTICS OF PROBLEM-SOLVING ACTIVITY 


If problems and problem solving are defined in this way, then problem- 
solving behavior will be characterized, like other motivated behavior, by 
C1) goal orientation and continuity of action toward that goal; and (2) 
change of activity after the goal is attained. Special characteristics, in COn- 
trast to routine activity, are (3) intraindividual variability, because the in- 
dividual makes diverse attempts at solution; (4) interindividual variability, 
because even the first attempts of different individuals are seldom the same: 
and (5) time required, because problem solving takes longer than the exe 
cution of a previously learned response pattern of comparable complexity- 
Furthermore, (6) the assumption of mediating activities is plausible. These 
mediating activities are assumed to be responsible for the longer time and 
the greater variability, and are particularly important for theoretical under- 
standing. If we may judge from the contexts in which the words think- 
ing and thought occur, they frequently refer to the time required CL had 
to stop and think”) and to the hypothesis of mediating activities. The in- 


troductory chapter noted that thought often has the meaning of mediation 
D 
and often also of problem solving. 
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It follows from all this that the shift from routine activity to problem- 
solving activity may result from the individual's perception of a physical 
barrier between him and his goal or, in the case of an abstract problem, 
from knowledge that he is not progressing toward a solution, Similarly, 
the reverse shift from problem-solving activity back to routine activity may 
result from attainment of a tangible reward, like food or money, or from 
the knowledge that he has produced an acceptable solution. Certainly in 
solving many problems with well-defined solutions—such as anagrams and 
construction puzzles—the individual stops, not when he is given a tangible 
reward, but when he decides that he has reached the solution. To the in- 
dividual who is inspired by intrinsic motivation, the knowledge that he has 
solved the problem is his reward. Until this occurs his attempts are unreward- 
ing and, given adequate motivation, his problem-solving activity continues. 
This broad definition of problem solving includes deductive reasoning, 
productive thinking, and judgment; but these activities show special charac- 


teristics that justify treatment in separate chapters. Concept attainment under 


certain conditions would also be embraced by this definition, but this class 


of problems too has special characteristics, noted in previous chapters. 


PROBLEM SOLVING AND LEARNING 

oblem solving differs from the performance of a previously 
is not to deny the importance of previous learn- 
thoroughly understood without clarifying 


To say that pr 
learned response pattern 
ing. Problem solving cannot be 


the contributions of such previous learning. lvi dl 
. 4 $ 
The old question about the relation between problem solving and learn 


ing is partly about the use of equivalent versus related but Tepee 
terms. The term reasoning has been used in the past as equivalent to prob- 
associative learning. This usage seems to be 


lem salvino in contrast to 
solving in contrast tc ; à 
j v in a more technical sense 


A pean re ee toda 
going out of style since reasoning is used sat Pact ehmetiocachieve: 
Cto be dealt with in Chapter 6). But in referring to tests of arithmeti 
ment, it is appropriate to speak of arithmetic reasoning in contrast to 

, S a ¢ 


‘ N kes of the nature of genuine 
computational skill, since the former partakes s 


Tea i 
ason Ing. 


‘ ; ing strengthenin 
When the term learning is given the narrow meaning of a streng g 


i ion becomes one 
of associations through rewarded practice, then the question 
D such associative learning. But when 


of differentiating problem solving from sspe : 

learning is broadly conceived to cover all medika of behavior, ened 
stion becomes one of differ- 

Solvi st be i J. and here the que 

olving must be included, ar 


learning. 
entiating problem solving from all other fom yh - s PaE a 
The terminological question has been avoided by referring to simple an 
Ə 


3 siig answer may suffice for pre- 
complex behavior. This superficially tennant vi tee distincti pS 
ut it does not give problem solving the distinction it 


limi ion, b 

nary exploration : ; s 

a a solving and learning are usually involved in com- 
ò 


merits. Both problem s 
plex human behavior. 
Actually, there is so 
problem solving and lear 
solving, and thinking wh 


me broad agreement in the distinction between 

ning. Duncan (1959) has pointed out that problem 
o š i 

en it means problem solving, differs from other 
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forms of learning chiefly in the emphasis on the discovery of the correct 
response. Experiments directed toward the processes by which the correct 
response is produced the first time are different from experiments directed 
toward an increase with repetition in efficiency of performance. 

Gagné’s (1964) more detailed answer to this question is worth quoting here: 


There is a striking difference which can perhaps best be summarized in this way: 
the learning situation for problem solving never includes performances which 
could, by simple summation, constitute the criterion performance. In conditioning 
and trial-and-error learning, the performance finally exhibited (blinking an eye, 
or tracing a path) occurs as a part of the learning 
the syllables or words to be learned are included 
concept learning, however, this is not always so, 
resemblance to problem solving in this respect. Although mediation ex- 
periments may present a concept during learning which is later a part of the 
criterion performance, many concept learning experiments do not use this pro- 
cedure. Instead they require the S to respond with a performance scored in a 
way which was not directly given in learning (the stating of an abstraction 
such as “round” or “long and rectangular.”) ‘Similarly, the “solution” of the 
problem is not presented within the learning situation for problem solving. Con- 
cept formation and problem solving are nonreproductive types of learning [p. 31 1). 


situation. In verbal learning, 
in the learning situation. In 
and there is consequently a 


As a result of such agreement in emphasis on the first performance of the 
solution it is not necessary to review experiments in which the terminal 
performance or solution is practiced. But the dependence of a novel solu- 
tion on previously learned facts, habits, concepts, skills, and other “sub- 


ordinate capabilities,” as Gagné calls them 


, is important for problem-solving 
research. 


Problem solving depends on previous learning—and goes beyond it. The 
dependence on previous learning seems to be the easier part to study, and 
many Investigators assume that the study of problem solving should there- 
fore begin at this point. This assumption is historically understandable, but 
it may be the kind of historical trend that impedes progress. As one experi- 
enced researcher (T. Kendler, 1964a) has pointed out, the question whether 
the S-R unit, so useful in the study of simple learning, is suitable for the 
study of problem solving remains open, Considerable progress has been made, 
as we shall see, by taking previous learning for granted and concentrating 
pe si ia 8 variables in the present situation. And some of the investiga- 
tions to be reviewed demonstrate how problem-solving research is contributing 
to a more thorough understanding of learning. As one would expect from 
the preceding chapters, it is the cognitive teost who have insisted most 
strongly on differentiating problem solving, or thinking, from learning; while 
the association theorists maintain the hope that ithe srinciples of learn 
ing will explain problem solving also, j } 5 

CLASSIFICATION OF PROBLEMS 


Students of problem solving, overwhelm 
man encounters, have often yearned for a 
which problems could be arranged and co 


ed by the variety of problems a 
tidy little classification system PY 
mpared in orderly fashion. There 
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is no standard system because the criteria of classification vary widely, de- 
pending on the purpose of the comparison. But there are a few differences 
between problems that researchers have found useful, at least in preliminary 
analysis. 

Obviously problems differ in difficulty. The proportion of failures and the 
average solution time can easily be recorded, so that problems can be de- 
scribed as “easy,” “difficult,” and so on, for a specified sample of subjects. 

Problems differ, at the outset, in the materials presented to the subjects. 
They may involve verbal, numerical, or spatial materials, that are concrete or 
abstract, ‘representative or nonrepresentative, personal or impersonal, arti- 
ficial or meaningful. Such distinctions are useful, but they are not permanent 
because the materials can often be converted to another form. Verbal prob- 
to numerical or spatial relations. Abstract 
problems become more concrete as algorithms or rules for solution are applied. 
videly used in problem-solving research, ap- 
pears to be one of rearranging a meaningless set of letters, but the research 
shows that the meaning of the solution word is one of the most important 
variables. 

Problems can be described also in terms of the processes by which they are 
solved: judging which of two lines is longer obviously is a different process 
from writing a limerick. Some problems seem to depend largely on the col- 
lection of relevant information by exploratory procedures; others, such as 
reasoning problems, present all the information, so that the solution consists 
of fully grasping the information in view. Psychological processes are matters 
of inference, of course; all problem solving requires certain basic nee 
anding of the instructions and full attention to the 
s way can only be tentative. But 
asize one process or another, as 
f the relevant attributes 


lems are frequently transformed 


The anagram problem, which is v 


such as a proper underst 
task—hence classification of problems in thi 
problems can be constructed so as to emph 
concept problems have emphasized the identification © 
or of the rule of combination 
apes . j lems offers sti 
Factor analysis of the abilities required for solving problems s still 


another method for comparing problems. — ue ae 
cal approach, which has resulted in abilities that rep i" 


F " is 2 ration of the form: 
abilities that represent processes. The ideal is en ¥ requires a certain 
= ñ at success on Pro 5 
aA + i 5 ras aster ines i of ability B, and so on. 

amount, a, of ability plus a certa ‘ 
k Degas F marized at the end of 
Some of the outcomes of this approach will be sum 


Chapter 5. 


This is a sophisticated statisti- 


DEPENDENT VARIABLES 
In the analysis of problem solving m 
‘and many theories proposed, but the b 
n dependent variables: jie most common dependent variable, and prob- 
requency of solution is the m? tured so that one solution can, be iden- 


lems nufac 
are usually chosen or ma ; 
tified as the ede oné: Occasionally, problems with two or three correct 


any conditions have been manipulated 
asic data come from records of only a 
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solutions have been presented and the frequency of each solution innovies 
(Research on problems with many solutions that must be rated for quality 
is considered later, Chapter 7, under productive thinking.) ie 

Time to solution, or latency, is also a common dependent variable. Typi- 
cally, time and frequency of failure are well correlated; but there are n 
sional exceptions because some people emphasize accuracy, at the expense 0 
time, more than others. 

Number of moves is a feasible measure when the moves are observable 
and discrete. 

Number of hints necessary for solution c 


an be recorded if the problem is 
difficult. This is a rather unsatisf. 


actory measure, however, because hints are 
not standard units and comparisons are uncertain. 

Verbal statements of the subject 
by some experimenters. These 
culties, generalization of a prince 
and explanations of phenomena. 

Some complex problems permit the calcul 
as the sequence of moves or the 
interest, but the ones listed 
from each of these depende: 
follow. 


are often recorded and are much employed 
consist most often of comments about difh- 
iple, description of problem-solving methods, 


ation of derived measures, such 
percentage of certain moves of theoretical 
are the most common. The evidence obtained 
nt variables will be illustrated in the pages to 


General Task Variables 


If we look at the materials that face 
see words, sentences, anagrams, 
switches, and many other kinds 
make a difference in the problem- 
still possible to Pick out a few v 
tion of many problems of varyi 
interesting in their own right 
needed for inferences about th 


people in the form of problems, we 
numbers, pictures, blocks, strings, lights, 
of materials. The problem content does 
solving activities, as we shall see, but it is 
ariables that influence the difficulty of solu- 
ng content. These general task variables are 
and also because they furnish the basic data 
e problem-solving processes. 

COMPLEXITY 


Complex problems are mor 
as anyone who has played 
Complexity can be m 


e difficult, by and large, than simple problems, 
Tick-tack-toe in three dimensions will aga 
easured in terms of the amount of material presented, 
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TABLE 4.1. NUMBER OF MOVES AND TIME OF SOLUTION 
IN DISK-TRANSFER PROBLEMS 


Number Minimal Number Mean Number Mean Time 
of Disks of Moves of Moves in Seconds 
2 3 3.5 7.0 

3 7 24.3 93:5 

4 15 182.7 557.0 


From Cook (1937). 


An experiment by Cook (1937) with the disk-transfer puzzle illustrates 
some of the relations. This task requires taking disks off one peg and putting 
them on another, using a third peg for storage, without putting a large disk 
on top of a small one. The complexity here can be stated in terms of the 
number of disks or the minimal number of moves, and by either measure 
difficulty increases at an increasing rate (see Table 4.1). As the number of 
disks increases, the necessity for planning moves increases, with a sharply 
increased possibility of mistakes and unnecessary moves. Similar results have 
been obtained with other problems that require planning of the moves before 
their outcome can be observed (Hayes, 1965). ; 

Word problems can be constructed so that each word is cia r 
the solution, in which case complexity can be measured in Fin g r e 
number of words (Johnson, Lincoln, and Hall, 1961). The example below 


illustrates a problem with four specifications: 


flat 1. map 
readable 2. book 
descriptive 3. label 
gummed 4. paper 

5. globe 


RE s ard the left and be prepared to find 
The subject is asked to study oe ae ae Then he turns to the list 


: ae T ie pee The time spent looking at each list is 
n the right and selects his a x i i 
recorded ca Fig. 4.1 shows vie Pepin ae Apo ¥ ne 
time spent studying the specifications, calle mE i iota at Raa e 3 
selecting one of the five solution words, as well as treq y SENG 
RON s a linear function, as the number of specifica- 
aration time increases, almost a ncreasing function as more and more 


sakes ight expect an i ea 
ns increases. One might exp ized, but it is likely that some of the 


an 
Words must be remembered and organ aH r 
additional specifications are redundant. The addition of words may actually 


add cues J] as restrictions and moderate the increase in memory load. 
as we ET i 
Thus the increased complexity does not increase the difficulty as much as 


one might expect. 
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15 


Preparation time 


Pe ee 
Solution time 


Median time in seconds 


3 4 5 6 7 8 9 10 1 
Number of specifications 


FIGURE 4.1. Preparation time and so 


lution time for problems with varying numbers 
of specifications for the solutions. Fron 


m Johnson, Lincoln, and Hall (1961). 


To study the memory load more directly the experiment was repeated with 
unrelated digits rather than words. When 3 to 1] digits were presented an 
the same way, to be matched with alternative displays of digits, the time in 
creased at an increasing rate. Log mean time was approximately linear with 
number of digits; since no organization was necessary and no cues were 


available, this logarithmic relation probably represents the memory load in- 
volved in grasping different numbers of 


” 

e could be rearranged as “Ie charme tout.” In 
psychological experiments the 

i i 

» guilt. From years of careful research, we om 

other problem, yet only recently 


sen in- 
š , the length of the anagram, been 1 
vestigated. 


One might expect that a small increase in the number of letters would 
produce a large increase in difficulty, Three letters can be arranged in only 
6 different orders, but 4 letters can be arranged in 24, 5 letters in 120, and 
ase in number of possible letter wane 
S of letters to be held in memory an 
cil and paper are usually not permitted.) 
ermine average solution time for long 
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anagrams because some people would not finish within any reasonable time 
limit. Kaplan and Carvellas (1968) therefore took the approach of holding 
time constant and determining the number of anagrams of 3 to 10 letters 
solved in 100 seconds. Almost all the short anagrams, such as gub and luge, 
were solved within this limit, but only a fifth of the long onés, such as 
ylhmcaelon. The increase in number of letters did not produce the large 
increase in difficulty that one might expect from the increase in number of 
possible orders and the memory load, however. That is, an increase from 
4 to 5 letters resulted in a drop in number solved within the time limit; but 
an increase from 8 to 9 letters resulted in a smaller, not a larger, drop. The 
and difficulty was more like that of the word 


relation between complexity 
and graphed in Fig. 4.1 than that of the disk- 
S 


problems described above and 


transfer problems shown in Table 4.1. f 
asons for this outcome. Longer words are more likely 


There are several rea i 
to include a repetition of letters, which cuts down the number of possible 
letter orders. But people do not solve long anagrams by E eo 
at random, according to the laws of chance, anyway. They e me li s 
hypotheses, usually with familiar combinations of three or four etters, an 
add others. Thus many rare letter combinations are eliminated from con- 


sideration. Also many of the longer words have familiar r S a a 
as ness, ory, and tion, that can be identified as a E coe P as 
units, and there are fewer long words than short words to think about as 


Possible solutions. 

If the elimination of dis 
easier, the addition of unnecessary 
dificult. Most laboratory problem 
all the items must be involved in t 
code puzzles, the addition of super 


tracting material makes some complex problems 
> material can make certain problems more 
s are constructed, like anagrams, so that 
he solution. In the case of complex verbal 
fluous data, whether apparently relevant 
or irrelevant, slowed down the solution paar 1268.: Puntas haan 
amount of data was not so distracting as the probability of starting ona 


i P: i un 
wrong approach, which was increased by an increase be os FN nee an 
in view When the letter cues were eae . : I ear ee “i o 
k amai onstan 1 e in com- 
Starting out on a wrong approach remained ¢ i 
plexity did not increase solution pets such experiments is that, while com- 
Sie peneralizanon ma. ema Get than simple problems, the increase 
plex problems are usually more d é es 
in di lty i of voderated by nonrandom ie seis the pees 
Hculty is often n s O g the people so ving the prob- 
Materials and by nonrandom proclivities lee ; eer seh pi fo 
lems. An increase in the amount of oe = 5i ET A z 
as no a false start is ase addin 
solution, but the probability of making @ ) g 


Superfluous data. 


pert 


FAMILIARITY . : 

Familiarity with the materials in 3 e ean ey S 
solutio A f s rote learning and othe i ; a 
kas LEE familiar terms are somewhat easier for college 
in 


that syllogisms written 
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students than syllogisms of the same logical form written in m 
such as chigs and gorils, or in terms of letters. Brownell and orenk C i i 
investigated this effect on the solution of arithmetic problems by pare. : 
problems in four different versions. Form A was written in terms fami Br 

children—for example, children making posters. Form B used less Familiaz 
terms—for example, soldiers grooming cavalry horses. Form C used ern 
even less familiar. And Form D used artificial terms, for example, shula is 
destroyed by planti. When problems made up in these four forms, otherwise 
equated, were given to fifth-grade children, Ta 
tions were : A 64, B 58, C 57, D 51. These differences are not large, consider 
ing that the differences in familiarity are extreme. They indicate the same 
small effect of familiarity as that found by Wilkins. Such small effects are a 


result of familiarity with the terms of the problems, not with the problem- 
solving operations. 


the percentages of correct solu- 


In the solution of anagrams the 


pertains to the solution word rather than to the anagram presented as 

problem. We do not know, of course, which words are familiar to cach 
individual subject; but words that appear frequently in ordinary reading 
matter must be more familiar on the average to a large group of subjects. 


Fortunately, millions of words of English text have been counted for other 
purposes (Thorndike and Lorge, 1944), and fron 


Tresselt (1958) selected words that are 
beach; infrequent, cobra; and vi 


effect of familiarity is large, but this effect 
a 


1 these counts Mayzner and 
very frequent, such as chair; frequent, 
: 5 a alte at 
ery infrequent, ghoul. They suspected tha 


; 

; : i for 
letter order would be an important variable also, since the word beach, fo 
example, can be made into an easy an 


and a hard one hecha. Wh 
and gave them to college 
the solution word, as well 
Furthermore, 


agram by rearranging the letters eachb, 
en they rearranged many such words as aaam 
students to solve, the effects of the familiarity © 
as letter order, were very large (see Table 4.2). 
if the anagrams have three solutions and the subjects must eei 

S written first have the highest word frequency an¢ 

e the lowest (Johnson and Van Mondfrans, 1965). 

This Strong effect 


ai s, 
of word frequency, confirmed by other experiment 
leads to Intriguing theoretical speculations given later in this chapter. 


TABLE 4.2, EFFECTS OF LETTER ORDER AND FREQUENCY 
OF SOLUTION WORD on MEAN ANAGRAM 
SOLUTION TIME Gn seconds) 
Easy Order Hard Order 
Very frequent 9.7 51.0 
Frequent 7.3 55.7 
Infrequent 20.5 116.5 
Very infrequent 46.8 135.0 


From Mayzner and Tresselt (1958). 
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ABSTRACTNESS 

The question of abstractness in the achievement of certain concepts was 
discussed briefly in Chapter 2. The best examples of the effect of abstractness 
on the difficulty of problem solving come from mathematics. The meaning 
of the number 3 is more abstract than three objects in view, and an algebraic 
symbol, X, that refers to any number has more abstract significance than a 
specific number. It is generally true that problems involving highly abstract 
materials are harder than those that involve only concrete materials. 

The abstractness, obviously, is a property of the referents of the symbols, 
not the symbols themselves. The letter X is no harder to perceive than the 
digit 3. Furthermore, as abstraction increases, we tend to use rules or algo- 
rithms for manipulating the symbols, ignoring temporarily their abstract 
Significance. As an example, we learn an algorithm for finding the product 
of X + Y and X — Y that involves manipulating these symbols in such a 
way as to obtain a result that holds for the abstractions to which these terms 
refer, The use of the algorithm reduces, in practice, the abstractness of the 


problem. 

We should note that 
more abstract words tend to be more 
when a problem is difficult, we often assert th 
sidering the other task variables. 


abstractness and unfamiliarity often overlap: the 
unfamiliar. We should note also that 
at it is abstract without con- 


EMBEDDING 

Embedding is a structural property of the ater i ` : 
easily illustrated by visual patterns. The principle states that a pa ern is 
hard to perceive when its outlines coincide with the outlines of no her, more 
inclusive pattern. Two illustrations of this are given in Fig. 4.2; Fig. 4.3 
Contrasts complexity and embedding. 


The effect of embedding on difficulty turns up 
In the following line, the problem is to find an 


FIGURE 4.2. Embedding. In 
each row the figure on the left 
is concealed in both figures on 
the right. After Gottschaldt 


(1926). 


material presented. It is thus most 


also in the perception of 


typographical patterns. 
animal word: 


a robe a robe a robe 
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FIGURE 4.3. Complexity 
without embedding. 
From Johnson (1955). 


It is somewhat easier to find it in this line: 
wofdulgibeark 
and very easy to find it in this: 


xyz bear stuvw. 


The embedding is destroyed, of course, if the critical pattern is somehow 
made more salient. The effect of salience in the illustration below is analogous 
to the effect of salience in concept attainment. 


a roBE A Robe a robe 


When this Principle is applied to abstract patterns, it is often confonndeg 
with familiarity and abstractness. Hence specifying the abstract pattern and 


: ee wee ical 
the larger pattern in which it is embedded becomes a critical methodologic 
issue. The processes by w 


à Ani? o 
hich such abstract structures are reorganized t 
facilitate cognition of the s 


olution will be discussed in detail later. 


SPECIFIC DIFFICULTIES 
Much can be gained by 
influence the difficulty 
solving turns up difficult 


problem. There are di 


identifying the general factors or conditions that 
of problems. But nearly all research on pon 
ies specific to a class of problems or even to a single 
ficulties that are specific to anagram problems, tO 
syllogism problems, to construction problems. The influence of anagram letter 


order on solution time, for example, is peculiar to the anagram problem. 
Anyone with experience in teaching others how to solve arithmetic problems, 
automobile repair problems, or personnel problems can identify specific 
sources of difficulty not covered by a few generalizations. This does not mean 


that the generalizations are not helpful; merely that they do not account for 
all the variations in difficulty, ý 


Problem-Solving Processes 
as Inferences from Behavior 


The evidence assembled above on general task variables shows that we z 
learn something about problem solving by observing the input (the materia?s 
of the problem) and the output (the solutions) without speculating very 


PROBLEM-SOLVING PROCESSES AS INFERENCES FROM BEHAVIOR ** 145 


much about the intervening activities. The yield from this input-output 
analysis soon approaches a limit, however; hence many psychologists infer 
problem-solving processes that mediate between input and output. The hypo- 
thetical processes vary from association of ideas to formal reasoning and com- 
plex strategies for processing information. But they are all similar in that 
they are attempts to answer dynamic questions: What is the subject doing? 
How does he solve the problem? 

Problem-solving processes are dynamic constructs assumed to take place in 
time, in contrast to some other hypothetical constructs—such as cognitive 
structures, frames of reference, and abilities—that are static or latent. The 
aims of this type of theoretical endeavor are to describe such processes, to time 
them, to discover the conditions under which they occur, to determine the 
contribution of each to the solution of problems, and to record individual 
differences in their operation. Before reviewing the outcomes, we should 
scrutinize the methodology of this kind of theorizing. 

A half century ago analysis of one’s own thinking was a fashionable enter- 
prise, based on the assumption that the thought processes would be revealed 
to introspection by an astute psychologist. Also fashionable was the think- 
ing aloud” method, which required the subject to report his thoughts while 
solving a problem and allowed the experimenter to classify the eg into 
intuitively reasonable categories labeled as thought pra SH 
(1910) description of “how we think” was widely mo ; i ppoe . ms 
to analyze reflective thinking for schoolteachers, be ore ae ow 
thought may be educated. It was a subjective analysis, base , he said, upon 
"description: q ly simple, but genuine, cases of reflec- 
descriptions of a number of extremely P g 


tive experience,” which were turned in by students. 

reveals, more Or less clearly, five logically 

distinct steps: (i) a felt difficulty; Gi) its location and mia ip iy ond 

Bi oT sin 3 development Nag ia ee 

of the suggestion; Cv) further observation is ee a alien [p. ga. 

anceier rejection; that is, the conclusion of belie! ; 

ysis of this kind, by Graham W allas (1926), began 

ably of information, eis ae aaa 

i ively he thinker has 

phase, called incubation, was described sg tae ior ERT to hatch, and 

lo i th the problem, he lays it away 5 and, 
aded himself up wit P ice anything, something must be going 

ya he does not consciously a ae sony totes ose, Ths 

on because later, at some Ca 


isi leads 
: happy thoughts or promising new ; 
third phase, the occurrence of these happ) 


3 ; ri ion 

he named illumination. The final testing of the leads w e n ieee 
ed ilu x uccumbed in time to 

Such subjecti alyses of thought processes $ a 

_ Such subjective analy methodology treats such research 

increasing emphasis on O 


Sane ent 
piectivity. Curr i: : 
Pe ene the consistency and utility of a coding system 
an attempt to emor 
for interpreting subjectiv 


data (as in content analysis of protocols from 

e E 

Projecti interviews) and seeks objective procedures as well. But 
‘Ojective tests or 1N 

the ancient fascination with 


“how the mind works” still persists in the modern, 
ae -solving activities. 
More critical efforts at the analysis of problem-solving 
ritica 3 


Upon examination, each instance 


Another influential anal 
with the preliminary assemb 
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The identification of one activity implies a differentation from other ac- 
tivities. The units of activity of interest in this theoretical endeavor—whether 
they are called processes, phases, operations, techniques, procedures, strategies, 
or methods—are smaller than the complete solution of a problem but larger 
than the single responses. Thus an inductive procedure is feasible if some 
regularities in the subject's activities can be identified and differentiated from 
others. If the subject begins by adjusting the light or fetching a ruler before 
manipulating the problem materials, we may describe the first activities as 
preparatory and, if desirable, time the preparatory process. In some problems, 
uncovering objects and looking for a malfunction has been differentiated from 
repair of the malfunction. When the problem is prese 
ing of the problem can easily be separated from writing solutions. If the prob- 
lem is a serial one, the sequence of moves may disclose regularities. 

As to the temporal relations between intellectu 
possibilities can readily be imagined: we can imagine two activities as com- 
pletely different and completely separated in time so that, if we knew where 


to draw the line, we could divide them cle: 


arly, Alternatively, we can assume 
that the problem-solving episode is undifferentiated and that wherever a divi- 


sion is made, the activities on one side would be indistinguishable from those 
on the other. But everything we know 
the most likely possibility is ne 
trated visually as 


nted on paper, the read- 


al processes, two extreme 


about living organisms suggests that 
ither complete separation of the processes, illus- 

=== — » Nor no separation Ç == =), 
but partial overlapping (C DR Reading aloud, for example, in- 
cludes a perceptual process and a vocalization process, and the first no doubt 
precedes the second, but the two overlap to the extent of the eye-voice span. 
Regressive eye movements, when the material is difficult, indicate an even 
greater overlap at times, Problem-solving Processes, like physiological proc- 


esses, are functionally interdependent in that the conclusion of one initiates 
the next; but the first does not always cease when the second commences. 


In describing a factor analysis of the abilities required for solving problems, 
Merrifield, Guilford, Christensen, and Frick (1962) outlined five phases: 
Preparation, analysis, production, verification, and reapplication. These are 


not viewed as clear- ather, “there is an approximation 

: 2 ’ 
to temporal ordering with much overlapping of particular events [p- 2]. 
Preparation and analys; 


gnitive phases; the motivation leads 

d information, examination of goal 
requirements, and th search model, or alternative search 
models, for the cont ion. The production of solutions involves 
both convergent an abilities, according to this account. Verifica- 
d evaluation and judgment, results in accep- 


tion, which 


mplex problems; evaluation, especially, may 
urpose of this description was primarily to 
sts for the abilities involved in each process 


of factor analysis with general psychological 
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The inductive procedure is best suited to activities that are at least partially 
observable. The deductive procedure begins with a hypothesis about covert, 
or mental, processes and attempts to arrange experiments to check it. The 
hypothesis may come from the investigator's introspection, from a mechanical 
analogy, from logic, from a mathematical model, from a computer model, or 
from principles of learning and perception. It may follow from a rational 
model of optimal procedure, such as a strategy of concept attainment. 

Such a hypothesis leads to various facilitative and inhibitory arrangements 
of the conditions and training procedures, with predictions of frequency of 
success, time of solution, frequency of errors of different types, and other 
dependent measures. Occasionally, an adequate test of a hypothesis requires 
constructing a novel problem; for example, if we wish to isolate an 
information-seeking process for special study, we can devise “troubleshooting 
problems,” in which repair is easy and the problem consists mostly of finding 
the trouble. For some purposes, questions about the problem have been 
printed on one side of a card and answers on the other side, so that as the 
subject turns over cards, his information-seeking activities can be recorded 
CRimoldi, 1955; Rimoldi, Devane, and Haley, 1961). Several methods have 
been described (Johnson, 1960) for separating the preparation process from 
the remainder of a problem-solving episode and timing it. One of these 
exposes the problem materials serially by putting the illumination of different 
Portions under the subject’s control (some of the results of the serial exposure 


technique were mentioned earlier). Problems have also been constructed in 


the form of lights and switches so that all the subject’s overt activities can 


be recorded in sequence (John, 1957). NT: si 
The overall assumption behind these attempts to identify processes is that 


they will lead to more adequate descriptions of problem solving and more 
Precise predictions than can be obtained from correlating observable ante- 
cedents with observable solution responses or from theories that assume only 
One process. A generalization that holds for one process may not te for an- 
other, and procedures that improve one process may nor ii prOYe another. All 
of these methods for inferring processes, and the logic behind them, will 
be illustrated in the experiments to be reviewed. aa i 
There is no standard catalog of problem-solving Pe s ina is a 
large amount of experimental evidence. estes e ae ee ah 
marized around the following processes. cane sin sete me ime rei 
called upon to explain problem solving as we i> e A ut nei 
the individual shifts from routine activity to pro SR VWB; ns activities 
come under the control of problem-solving ore am ete z set 
have considerably advanced the understanding Ob POTEP AG B rom laily 
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n often necessitates representing and transform- 
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& AEB ane and enlightening. We must also consider the judg- 
Structures is abundant and eng È 


ment process in problem solving. A more recent endeavor which promises 
rich a ca Sota not yet been fully integrated with the others is the 
wards but has J J 
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vincing explanation for the solution of anagrams because frequency tables 
supply independent data on frequency and generate the predictions that 
Mayzner and Tresselt confirmed. Note that nothing is assumed about any 
separate process controlling the responses; the sequence of responses is simply 
a consequence of variations in their strength. 

The same theory about previously learned associations can be applied to 
pairs of letters, known as digrams. Some digrams occur frequently in 
English, so that anyone who reads has had considerable exposure to them; 
others, however, are rare. Digrams were first counted by cryptanalysts as an 
aid to deciphering coded messages, but recently psychologists have made more 
extensive counts as an aid to research on verbal performance. To take some 
examples, the digram hi is very common, while ih is uncommon; another pair 
of high frequency is ca, in contrast to the infrequent ‘re. Thus a five-letter 
word may consist of four digrams of high frequency while another may con- 
sist of four digrams of low frequency. Except for very easy anagrams that 
are reorganized wholisticly as soon as the experimenter can stare and stop 
his watch, anagrams are usually solved by manipulating combinations of 
two, or perhaps three, letters. Mayzner and Tresselt (1962) assumed, accord- 
response hierarchy, that digrams of high frequency 
will be elicited first and therefore that words consisting of high-frequency 
digrams will be solved more rapidly than comparable words consisting of 
low-frequency digrams. An appropriate experiment found the expected dif- 
ference in solution times, though this effect was much smaller than the 
effect of word frequency. 

The difficulty of problem 


ing to the principle of the 


s, according to the principle of the response 


hierarchy, is due to interference from the incorrect responses. Considering a 
Problem as a stimulus, if incorrect responses have high associative strength 
to the problem, they are produced first and so delay the production of the 
Correct response. It follows that anagrams without such interferences will be 
relatively easy, and Ronning (1965) applied this principle to generate a 
“tuleout” factor that predicts the difficulty of anagrams. There are many 
combinations of letters that do not appear 4 the beginnings of English words. 
Furthermore, vowels are not as common as consonants at ie eae of a 
Word. Any well-read problem solver is likely to T an suc ’ sone It 
is true that the 5 letters of the anagram higlt can be arranged in itfer- 

hl, tg, and thl can be ruled out and 


ent orders, but such combinations as hg, 
do not delay the evocation of the correct sg n fact 90 sil the 120 
Permutations of these 5 letters can be ruled out. Thus this particular anagram 


Would be easy, while one with few letter combinations that aa be alae ee 

would be harder. An experiment demonstrated that this “ruleout” factor did 

3 e harder. y $ a 

influence the solution of both high-frequency and B fepeny o Ob- 

viously, the ruleout factor overlaps other factors, such as digram trequency, 
ut when some of these complications were controlled, the ruleout effect 


Was still sub: ial (Gribben 1970). 
stantial (Gri , 
here is a more direct way: to increase the response strength of a word 


and make it readily available for problem solving. The frequency counts 
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generation of problem-solving processes front computer simulation. A reviev 
of the research under these seven headings follows. 


Associative Mechanisms 


The stimulus-response behaviorists avoid describing behavior in terms of 
logical operations as they avoid identifying psychological processes from in- 
trospective evidence. Facts that others have interpreted as evidence of the 
control of responses by some hypothetical higher process have been inter- 
preted by behaviorists in terms of associations and chains of associations. 
Complex associative mechanisms have the same 
theories of problem solving as problem-solving 
tended to account for the relative difficulty 
problem-solving activities over time, ty 
ciative mechanisms are supposed to o 
derived from simpler situations. 


dynamic role in association 
processes, in that they are in- 

of problems, sequences of 
pes of errors, and so on; but the asso- 
perate according to laws of association 


RESPONSE HIERARCHIES 


N., . g 5 e 
The associative interpretation of problem solving often calls upon th 


principle of a response hierarchy, a class of responses elicited by one stimulus. 
This principle was originally developed to account for maze learning by rats, 
ut its most convincing applications today are in the field of verbal pee 
ance, as in the perception of briefly exposed words and the solution , 
anagrams. Examining the effect of frequency on anagram solution raises 4 
question that at first seems unanswerable: How can the time of solution of a 
problem be influenced by a characteristic of a solution word that is not know? 
until the moment the problem is solved? The explanation offered by May2- 
ner and Tresselt (1958) invoked a hierarchy of implicit verbal responses. I 
the anagram is considered a stimulus, any one of the many words in the sub- 
ject’s response repertoire, that is, his vocabulary, could be an implicit i 
But words, considered as responses, do not ‘ 
gth. Some words have been more thoroughly 
situations, and thus are more readily elicited as 
i in many situations. The relevant eee 
at are potential solutions, hypothetically ranked 
. To avoid confusion, it should be noted that 
the strength of the association between a response word and a specific 
ered here because the stimulus is only an 
anagram. 


. Since response strength depends on frequency of occurrence of the words 
in the past, high-frequency words will be elicited first, then words of lower 
frequency. Thus, if the Correct solution is a high-frequency word, the prob- 
lem will be solved quickly. If the solution is a low-frequency word, high- 
frequency words will be evoked first, the correct low-frequency word will 
be evoked later, and solution time will be longer. This is an especially con- 
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vincing explanation for the solution of anagrams because frequency tables 
supply independent data on frequency and generate the predictions that 
Mayzner and Tresselt confirmed. Note that nothing is assumed about any 
Separate process controlling the responses; the sequence of responses is simply 
a consequence of variations in their strength. 

The same theory about previously learned associations can be applied to 
pairs of letters, known as digrams. Some digrams occur frequently in 
English, so that anyone who reads has had considerable exposure to them; 
others, however, are rare. Digrams were first counted by cryptanalysts as an 
aid to deciphering coded messages, but recently psychologists have made more 
extensive counts as an aid to research on verbal performance. To take some 
examples, the digram hi is very common, while ih is uncommon; another pair 
of high frequency is ca, in contrast to the infrequent ‘re. Thus a five-letter 
word may consist of four digrams of high frequency while another may con- 


Sist of four digrams of low frequency. Except for very easy anagrams that 
as soon as the experimenter can start and stop 


solved by manipulating combinations of 
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relatively casy, and Ronning (1965) applied this principle to generate a 
“tuleout” factor that predicts the difficulty of anagrams. There are many 
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give an index of exposure to different words at unknown occasions in the 
past, but the experimenter can increase exposure by presenting the solution 
words just before presenting the anagrams. Wiggins (1956) did this by 
choosing anagrams with two alternative solutions, one of which was a more 
frequent solution than the other. As an example, under ordinary conditions 
90 percent of the solutions to irgn were ring and only 10 percent were grin. 
But for subjects who studied a list of the infrequent solution words just 
before working the anagrams, the frequencies of the alternative solutions 
were reversed. This is an illustration of how a little experience can promote a 
word of low rank, at least temporarily, to the top of the hierarchy. 

More subtle ways of altering the availability of words for use in problem 
solving are suggested by hypotheses about word association. Consider the 
anagram irach. Suppose, instead of the solution word, a common associate of 
the solution word, table, is presented just before problem solving. The hy- 
pothesis is that such exposure of a common associate will raise the availability 
of the solution word, though not as much as exposure of the solution word 
itself. Dominowski and Ekstrand (1967) found that the mean solution time 
for a series of anagrams after priming with the actual solution words was 
1.95 seconds, and after priming with words frequently associated with the 
solution words, 7.25 seconds. These means are to be compared with a mean 
of 31.00 seconds for a control group lacking any priming, For other sub- 
jects, who expected that the words of the preceding list were relevant when 
they were not, the comparable mean was 54.35 seconds. fi 

Since recent learning of words has such a strong influence on anagram di j 
culty, even the order in which the words are learned might have an effect. I 
the series of words is learned in a fixed order, perhaps they will be recalled m 
that order; then the anagrams that are solved by the first words of the series 
will be solved quickly and those solved by the last words slowly. Davis and 
Manske (1968) tried out this hypothesis with ten-letter anagrams but did not 
find the expected effect, What they found instead was the well-known serial 
position effect. When a series of words is memorized in a fixed order, the 
first and last sections of the series are learned quickly, and the middle section 
is the last to be learned. When the series of words was followed by anagram 
solution, the words in the middle of the series were the hardest to learn, aS 
usual, and the anagrams constructed from these words the hardest to solve- 

We have long known that previous learning contributes to problem 
solving in a general way, but in these experiments the facilitation, and also 
the interference, has been specified with some precision. 


INTEGRATION OF BEHAVIOR SEGMENTS 


The solution to an anagram problem is a word from the subject's response 
repertoire; yet the solution to many other problems is a response, or pattern 
of responses, that the subject has not previously performed. Years ago the 
classic demonstrations by Gestalt psychologists, that apes could reach a 
banana in a new way by putting two sticks together and that many animals 
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readily take a detour to a goal when necessary, challenged associative inter- 
pretations based on previous learning. Clark Hull (1935) therefore under- 
took to explain such novel accomplishments by postulating mediating asso- 
ciative mechanisms that have the role of integrating separately learned 
behavior segments. Although Hull's integrative mechanisms were derived from 
nonhuman behavior, they were intended to apply generally and can be illus- 
' ior of children solving problems. 

Kendler and Kendler (1956) was to 
hidden at G. If a child learns to 


trated most conveniently by the behav 

The problem in one experiment by 
attain a goal, specifically an attractive toy 
pull string B to get G and to pull a different string A to get a subgoal at B, 
will he combine these components on a later occasion and pull A to get G? 
The associative interpretation states that when B is performed to get G, B is 
reinforced and in turn acquires some secondary reinforcing value. Thus 
when A is performed to get B, this segment is secondarily reinforced. On a 
later occasion, the stimulus A elicits implicit responses that have been in- 
directly associated with the goal G. In other words, the B component acts 
as a mediator linking A with G. . . 

Any child will pull a string, of course, so Kendler and Kendler introduced 
another string X, tied to a subgoal Y, which was comparable to B, but had 
not been associated with the major goal, the attractive toy G. The test, then, 
was the frequency of choices of string A over string X. But this was not an 
experiment in learning, by rewarded practice, to pull one string rather than 
another. The question was whether the child, having learned two separate 
habits, would put them together and pull the correct string on the first test 


trial, 
as carried out with 128 children, three and four 


ed in solving the problem, as compared with 
thout experience with the B-G component. As 
f eight and ten, were able to make the 
er than young children (Kendler and 
Kendler, 1962). The weakness in this theory is the absence of a predicted 
order effect. Since the primary reinforcement was at G, = ed predicts 
that the optimal order would be to leam BG nee - ; ai in two 
experiments (Kendler and Kendler, 1956; Kendler 1961), no such order effect 
aPpeared. _ 
ved in terms of these associative mechanisms 
Problem solving can be analyzed in te parate habits in experiments sug- 
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(1966) attempted te discover whether adults, given difficult problems in the 


form of incomplete maps resembling mazes on paper, would work forward 

toward the goal or backward from the goal, he found that they would go 

either way depending on which direction had been effective on preceding 
2? fel 


problems. 


When this experiment w 
years old, 72 percent succeed 
47 percent of control groups Wi 
One might expect, older children, © 
Necessary integration of habits bet 
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Problem-Solving Sets 


Motives can be likened to general policies, varying in strength and 
pertinence, while the set is analogous to the executive manager, guiding sa 
organism moment by moment. The set for a simple performance, as a q 
reaction time or a word association experiment, can be considered a readiness 
to make a specified response to a specified stimulus. Thus the set generated 
by reading the first half of a sentence controls the interpretation of the last 
half. A set to solve a problem is more complicated, to be sure, but it still tan 
be considered a readiness for some specified activity. One side of the set is 
stimulus preparation, or attention; the other is response preparation, and 
either side may be emphasized for experimental purposes. The response may 
be a complex pattern prepared and performed as a unit, and the stimulus 
an abstract attribute picked out of a variegated display; but the essence of 
the concept of set is that the readiness to make the response to the stimulus 
anticipates the appearance of that stimulus. 

The set itself may be considered an activity 


or process, but it is an 
activity of short duration, preparatory to some other activity. The set alone 
does not solve any problems. Hence two explanations are required: how the 
set is established, and how it controls subsequent activity. Most of the carly 
introspective analyses of the thought processes included a preparatory 
process, often under the name of determining tendency, and most modern 
behavior theories include some short-term mechanism or chain of mechanisms 
to control the final responses, though the names for these constructs vary. 
Since the term set has been wide 
solving and seems to be 
as any. 


. . A roblem 
lv used in relevant experiments on proble i 
a ie aanne as od 
generally understood, at present this seems as go 
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A central adjustment is prepared so that, of all the 


tivities the n" i srs. inhibited 
activities the organism is capable of, some are facilitated and others inhibit 
or delayed. Four conditions 


of problem-solving sets. 
In the first place, of all the 


supply much control of activity. | 
the subject simply 


i fi ‘ 7 ivation 
are particularly important for the activat 


motives for solving problems most do not 
n problem-solving experiments, for example, 
agrees to cooperate with the experimenter. The socia 
motives are the Operative ones, but their effects are quite general. 

Secondly, within a favorable motivational background sets are often 


activated by instructions. The control of one human being’s activity by in- 


d 
structions from another human being is a remarkable effect that no theory 
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of psychology can overlook. Instructions do more than communicate facts 
and request people to start and stop; they prepare them for perception and 
for action. These instructions may be rather general: “Your task is to 
construct words from these anagrams.” Or they may be more specific: 
ach anagram must be rearranged to form the name of a common animal.” 
A large part of the achievements of experimental psychology is built on 
the ability of human beings to prepare themselves in advance, when requested, 
so that they can respond optimally when the anticipated event occurs. 
The effects of interpersonal relations on problem solving are not limited 
to explicit instructions. The subject of an experiment may pick up cues 
from the experimenter’s voice, or his gaze, or a hand movement that leads 
or for one type of 


racy 


him to be prepared for speed rather than accu ” 
solution rather than another. Many findings of differences in results from 
different experimenters are probably due to the sets aroused by these subtle 
variables or by the subject’s own perception of the demands of the situation 
COrne, 1962). Thus taboo words are not offered as anagram solutions under 
ordinary social conditions, although they do emerge under more permissive 
conditions. 

Since sets function at an intermediate level, between motives and actions, 
the effects of motives may override the effects of instructions. Ask someone 
I - to see food in ambiguous pictures, and 
Many curious results have been 


adopt sets that 


who is nauseated to act hungry or 


he may try and fail, or he may not ty. 


obtained in the laboratory because people are loath to 


Conflict with their ulterior motives. 
Thirdly, sets are influenced, like 
When a person faces a problem that looks l 
Set to approach it as he approached similar problems in the past. , 
Fourthly, it is the shortterm effects of practice—in the form of a train- 
ing series of problems immediately preceding a test series—that have been 
most frequently studied in experiments on set. A brief series of problems can 
narrow the set so that, when another problem is presented, the solution will 


other activities, by previous learning. 
familiar, he usually reinstates a 


be quick and invariant. in B 
Thus the set is influenced by several antecedent conditions. But once 
i i individual’s efforts so that cer- 


d channels the 
and limitations can be observed. When the 


instructions and practice, he 


Sstablished, it integrates an 
tain momentary potentialities 


individual } od, by appropriate 
al has been prepared, ©? Seals : ; 
can attend to sounds, words, or numbers optimally, but his attention span 


is stil] not unlimited. Seven items are commonly cited as the number the 
Average adult can retain when the task is merely to listen and recall; but 
When the items are to be retained and related so as to solve a problem, the 


imit is probably lower. 

The set is limited on 
to Perceive the shapes 
It is limited on the resp 
will take him somewhat longer. 
tative or inhibitory. If the experimente. 
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r tells the subject to look for shapes 
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and the task requires finding shapes, the set instigated by the instructions 
will be helpful. If, however, the task requires finding colors, the shape 
instructions will be detrimental. Likewise, a series of arithmetic problems 
easily solved by addition engenders a set to add, which will be helpful if 
the meet problem can be solved by addition but not if it must be solved by 
subtraction. It follows that successful problem solving often requires shift- 
ing the set. 

A few representative experiments will serve to illustrate how problem- 
solving sets are activated and how thev influence solutions. 


ILLUSTRATIVE EXPERIMENTS 


The classical experiments were designed by Rees and Israel (1935) to 
demonstrate the importance of the set in problem solving by the use of 
ambiguous anagrams with two solutions. They demonstrated that an appro- 
priate set could be established by direct instructions. Subjects told to con- 
struct words of a certain category, such as nature words, solved more 
anagrams this way than control subjects. They also demonstrated that a set 
can be built up by a series of problems with solutions in the same category: 
Subjects who solved a block of anagrams by nature words solved more of 
the next block by nature words than control subjects. Rees and Israel were 
able to show the effect of a letter-order set as well. To take one example. 
lecam can be changed to camel by starting with the middle letter, and 
subjects who solved a block of anagrams by changing letters in this order 
readily solved the next block by the same procedure. Later experiments 
have verified these results, for both the category set and the letter-order set 
(Johnson, 1966). 

Luchins (1942) went a step further to illustrate the detrimental effects 
of a set. He had his subjects do a series of arithmetic problems, with three 
imaginary jars of water, that could be easily solved by two subtraction opera 
tions, then gave them two ambiguous problems that could be solved by the 
same procedure or by only one subtraction. Use of the longer procedure was 
considered evidence that the set established on the first series of problems bed 
persisted, and most of Luchins’s subjects did in fact use the longer procedure 
although control groups starting w 
so at all. Luchins also included a 
longer procedure but could be easi 
of the subjects failed to solve 
though there were few fail 
Luchins also increased the 


ith these ambiguous problems did not do 
problem that could not be solved by the 
ly solved by the short one. The majority 
this critical problem in the time allowed, 
ures in control groups that had this problem first. 
amount of practice with the first procedure ane 


FIGURE 4.4. Arrangement of letters 
for study of eye movements Bare 
2 solution of anagrams. The letters 7 
k c be rearranged in order 34521, aia 
3 b the numbering system shown p 
left, to form the solution word. Fro 
5 = i Kaplan and Schoenfeld (1966). 


PROBLEM-SOLVING SETS ** 155 


nsi 1 1 T m 
15 
| RD 
10 j N 
g3 Le 
n 
= Le = 
ohn f 1 
2 h 
215 IV 
£ 
w 10 
g PS 
= 
= 5 
2 
5: oad 
8 may 
15 
FIGURE 4.5. Solution time on 
successive anagrams for three sub- 10 
jects; ns indicates no solution. The 
first 20 anagrams were solvable by 5 DO 
Me: iile 34521, the next 10 by pe | 
ule 31245, and the last 10 by the 
rule 52143. From Kaplan and 0 S E a 


Schoenfeld (1966). 


ad increased the number of failures 
have confirmed these findings. 
ationed below have allowed subjects to ma- 
remove Covers, and so forth, in order 
that problem-solving sets and othe s can be inferred from observable 
information-secking movements. The serial exposure procedure for study- 
ng concept attainment (described on page 56) was designed to time per 
formance on a sequence of problems as the set developed and also the shift 
an inhibitory effect on the 


to a new set. The experiment demonstrated i i n 
Cognitive side: once the set to attend to the shape dimension was established, 


the texture dimension was ignored. Another outcome was pod “a and 
ti i x i o i ir sequence: the frst rormu ation 

ming of four problem-solving operations in q et ee aA 
of the problem; the unsuccessful search for a solution : a y h 
tion; switchback and reformulation; and the unsuccessful search for a solu 


tion to lation 
match the second formulation. l 
4 An ingenious experiment by Kaplan and pionki ee ‘em 
i Ta ements during problem solving. 
The approach by photographing a, a e because the 
he letter-order set for solving anagrams is co ie 
etters can be separated and fixation of the eyes on each letter recorded. 
Pe Fig. 4.4. One can begin with the middle 
letter, move to the lower left corner, and so on, to make brick. The first 20 


anagrams were soluble by one order, the next 10 by another order, and the 
Next 10 by another. The lowest graph in Fig. 4.5 shows that solution time 
for ] subject droppe d rapidly as he did the first few anagrams and rose 


found that this strengthened the set ar 


on the critical problems. Later experiments 
Other experiments to be mer 


nipulate light switches, turn cards, 
or processe: 


he letters were spaced out as in 
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temporarily when the order was changed for problems 21 and 31. Tihe a 
graph in Fig 4.6 shows that the drop in variability of eye pea ea 
this subject paralleled the drop in solution time as the letter-order sik 
veloped. The other 2 subjects did not adopt the set so readily, Consequently, 
their solution times were longer 
the same parallelism between eye 
seen in the records. 
These experiments w 
established and hav. 
formation-seeking n 


j pnt but 

and their eve movements variable, T 

ry P . n r be 
movements and solution times can 


ith various problems demonstrate that sets arc ie 
e predictable consequences on solution time, errors, : i 
hovements, and eve movements. In this respect, pionin 
solving is similar to many other activities of living organisms. Even a a 
requires a preparatory inspiration. Thus the concept of set in prob o 
solving illustrates the continuity between intellectual activities and simple 


PPE- Š i 5 Pa işs not 
activities for which a preparatory adjustment can be identifed. It is 


aa 7 i 5 ‘ Jood- 
Surprising that set is a favorite concept of American functionalism CW 
worth, 1937), 


THEORETICAL ANALYSES 


Fig. 4.7 has been Prepared to clarify 
at theoretical interpretation. The 
Within a certain laboratory 
menter to subject establish a 
out of anagrams. Then, as a 
lems, the subject establishes 
range the letters in a certain 


some research designs and attempts 
column at the left illustrates one deigr. 
atmosphere, general instructions from a 
set to solve problems, for instance, make ans 
result of brief practice with easy Alpha ae 
a set to use the Alpha method, that is, to pa 
order, so time and variability decrease. Since 
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we are assuming that the subject has one set at a time, we do not speak 
of this set as different from the general set with which he began, but we 
say that this general set has been narrowed and made more specific. There 
is no incompatibility between preparing to solve anagram problems and 
preparing to solve anagram problems by the Alpha method. 

It would not be correct to say that the subject learns new responses; he 
already knows how to add and subtract, to rearrange letters, and to spell 
words. What is carried over from the earlier problems in a series to the later 
ones is the adjustment; trial and error has been eliminated and the sub- 
ject looks directly for the critical aspects of the materials presented. When 
he perceives them, he quickly makes the manipulations he is prepared to 
make, The observable reduction in solution time and invariability of prob- 
lem-solving attempts, in comparison with a control group of the same back- 
ground and the same general set who have not had the training series, is 
the evidence for assuming that a set has developed. The test series might 
ambiguous problems, soluble by either the Alpha or 
Beta method, and another group would be given a training series of Beta 
problems. Then the dependent variable would be frequency of use of Alpha 
and Beta methods on the ambiguous test problems. ; 

The column at the right of Fig. 4.7 illustrates another design. Instead 
| instructions and narrowing it by specific 
eginning by specific instruc- 


also be a series of 


of establishing a general set by 


training, a specific set is established at the b 


Motives, social situation, and 
other background conditions 


Motives, social situation , ond 
other background conditions 


Specific instructions prepare 
specific set to solve problems 
by alpha method 


General instructions prepare 
general set to solve problems 


Training series of alpha problems 
Prepare specific set to solve 
Problems by alpha method 


Test series: Alpha problems 
or ambiguous problems 


Test series: Alpha problems or 
ambiguous problems 


Beta problems disrupt set and 
Prepare specific set to solve 
Problems by beta method 


Effects of specific 
Ef fects of series and instructions 
shifts of series 


FIGURE 4.7. T uences for research on problem-solving sets. 
.7. Two seq s 
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tions. The dependent variable may be solution time or frequency of solu- 
tion of ambiguous problems by the alternative methods. As far as the 
observable evidence indicates, it is the same set whether developed by gen- 
eral instructions and a training series or by specific instructions. 

In order to study shifts of set, many experiments have included a differ- 
ent series of problems for which the first set is not appropriate. When a sub- 
ject set for solving Alpha problems by the Alpha method encounters Beta 
problems, he fails, or he shifts to the Beta method. Typically this set, while 
not incompatible with the general set to solve problems, is incompatible 
with the previously practiced specific set. Hence development of the new 
set implies cancellation of the old, and questions of the interfering effects of 


previous training arise. The new set can be instigated also, of course, by new 
instructions. 


EFFECTS OF THE TRAINING SERIES 

Although the training series usually requires only 15 minutes or so, the 
acquisition of a problem-solving set is an obvious opportunity for testing 
various theories of learning, Maltzman and Morrisett (1952) looked at the 
development of a set for solving anagrams from the point of view of S-R 
learning theory and tried to test some quantitative predictions. The growth 
of habit strength as a function of practice is negatively accelerated; prac- 
tice produces improvement in the lower part of the learning curve faster 
than in the top part. This leads to the prediction that a difference in habit 
strength will develop more rapidly between two weak habits, otherwise 
equal, than between two strong habits. In their first experiment, a strong 
habit strength group received 15 eating-set anagrams and 15 nature-sct 
anagrams in random order before solving 10 test anagrams. A weak habit 
strength group received irrelevant anagrams, then 2 eating-set and 2 nature- 
set anagrams before the 10 test anagrams. The test anagrams were all 
eating-set anagrams for half the subjects in each group and all nature-set 
anagrams for the other half; and the critical measure was the frequency of 
appropriate set solutions. The prediction from the theory was that the weak 
habit strength group would learn the new set faster, and the confirmation 
of this prediction supported the assumption that the set can be treated as 
a complex habit acquired in the laboratory situation. 

The effects of the experimenter’s instructions cannot be separated com- 
pletely from the effects of a training series because as soon as the subject 
begins to follow instructions, training effects begin to cumulate. Training 
rig cannot be completely separated from the effects of instructions 
either, because some general instructions always precede the training series 
as part of the subject’s introduction to the experiment. But Maltzman and 
Morrisett (1953) were able to study the effects of specific instructions 
added to the effects of a training series. Instructions to “look for a particular 
order” given just before the training series did increase the frequency of 
letter-set solutions above that produced by the training series alone. Similar 
instructions given just before the test series also increased the frequency 
of set solutions (Maltzman, Eisman, Brooks, and Smith, 1956). Apparently, 
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instructions have an effect on the set that cannot be replaced by the growth 
of habit strength during a training series; hence the experimenters had to 
conclude that the effects of instructions on the set are not completely ex- 
plained by associative learning. 

Safren (1962) applied a word association theory, coming from research 
on free recall, to the development of a set to solve anagrams. She began by 
making up lists of 6 words of maximal interitem associative strength, such 
as milk, cream, sugar, coffee, sweet, and drink. The hypothesis was that a 
Series of anagrams so constructed will be easy, because when 1 solution 
word is attained, its associations facilitate recall of the solution word to 
the next anagram, and then the associations of these 2 words further facili- 
tate recall of the solution to the next, and so on. Six such lists were used, 
and the same 36 anagrams were also used in 6 unorganized lists. As ex- 
pected, there was a gradual reduction in median solution time for the 
group who did the organized anagrams but not for the group who did the 


same anagrams without the interitem facilitation. Overall, the median solu- 


tion time for the unorganized lists was 12.2 seconds, and for the organized 


ists, 7.4 seconds. 


Another group in the Safren experiment received special instructions 
ò 


giving the labels of organized lists, such as beverages, and for these the 
median solution time was 2.8 seconds. As noted above, Maltzman and 
Morrisett found a parallel effect of instructions in addition to the effect 
large effect of the labels suggests that each one 
l a class of solution words. This 
describing a specific mechanism 
ams, but the specificity limits 


of a training series. The 
a cluster or 


e virtue of 
r solving anagr 


facilitates immediate recall of 
word association theory has th 
for the development of a set fo 
it to the category set. , 
The experiments on the development of problem-solving sets are parallel 
in some respects to experiments on the acquisition of concepts and principles. 
Successive problems present apparent differences and abstract similarities, 
and as the subject proceeds, he perceives the similarities easily and executes 
This parallel recalls the controversy between 
conceptions of concept learning discussed in a 
sa methodological criticism. The experiments 
given above have reported group data only. The possibility of exons 
in the individual acquisition curves, which is the most interesting alternative 
to incremental associative learning, is obscured by averaging. Stellwagen 
t of a category set for anagrams 


C1968) therefore plotted the developmen sÍ 7 
by following the procedures described in Chapter 2. She gave a series of 


17 anagrams, constructed from 5-letter food words, to 40 subjects in different 
Orders. When the solution times were averaged in the usual way, the 
usual gradual decrease was evident, suggesting cumulative learning. But 
when the solution times were combined before and after each subject’s last 


long solution time, a discontinuity was apparent, as im Fig. 4.8, suggesting 

that the different subjects “caught on” at different points in the series, after 
d : . - 

Which they solved all problems quickly. The usual large variation in solu- 


the common response quickly. 
Continuous and discontinuous 
Previous chapter, and suggest 
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FIGURE 4.8. Solution times for 


‘ o 
a series of anagrams arranged to show evidence f 
discontinuity 


r s ; iscontinuity less 
tion times before the decrease makes the demonstration of discontinuity 
than completely convincing, 


ative e 
but the results do suggest an alternative to th 
Customary interpretation in t 


erms of cumulative associative learning. 

It is not necessarily the best start to treat the development of a set as the 
acquisition of something; something is lost also, Rather, the development 
of a problem-solving set should b 


e regarded as a progressive sharpening 0 
the general set communic. 


ated by the instructions. Instructions to look for 
a letter order or to look for a class of words sharpen the set somewhat, while 
instructions giving the actual letter order or the category of solution Rio 
sharpen it more specifically. This view is not incompatible with the view o 


a set as a complex habit if we consider it a temporary habit, easily turned 
on and off. 


The nature of the 
of a problem-solving 
that initiates problem 
is likely to continue t 
in these experiments 
cept problems—have 


motivation for solving problems and the reinforcement 
set must then be examined. Whatever the motivation 
solving, when the subject succeeds by one method he 
o use that method. The problems that have been used 
—anagrams, little arithmetic problems, and figure cor 
been chosen by the experimenters because the subjects 
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can recognize a solution as soon as they attain one and stop work. It is 
or toys or pieces of candy; the reinforcement is 


not necessary to give points 
he has or has not succeeded. The fact 


cognitive in that the subject knows 
that there might be an alternative solution, even a shorter one, does not 


influence the subject unless he knows it. 


SHIFTING SET Z each ‘ 
Since failure to solve a problem is often attributed to rigidity, shifting 


from one set to another has attracted considerable experimental attention. 
Schroder and Rotter (1952) approached this rigidity question from a social 
learning theory that emphasizes the subject's expectations. “Under what 
conditions does an individual enter a problem situation with an expectancy 
that a single behavior will lead to reinforcement and under what conditions 
will he expect that there may be more than one way of reaching the goal 
[p. 141]?” To study this they devised training series which, according to 
their hypothesis would build up different expectancies about alternative 
paths to the goal. Each problem consisted of six designs to be grouped into 
a training series were arranged so that some subjects 
y one attribute (form) and others various kinds 
| ping by one of two attributes alternatively 
f the test series, which required shifting 
another, showed clearly that those training 
and built up expectancies for alternative 
ater test problems. Similar results 
(1958) with the Luchins arith- 
of simple associative learning 


three pairs, and the 
had practice in erouping b 
of practice that encouraged grou 
(form or color). Performance © 
from one method of grouping to 
Sequences that forced shifting 
Toutes to the goal facilitated shifting 0n l 
Were obtained by Gardner and Runquist 
metic problems, but thev preferred the language oF $ 
so they analyzed the situation in terms of competing responses rather than 
alternative expectancies. 
Another variable that coul 
effort invested in the practiced met 


d influence readiness to shift is the amount of 
hod. Knight (1963) gave his subjects only 
one arithmetic problem for training, but one group had i — easy 
to add and subtract while the other group had to go through the same 

-place numbers. Those who worked 


arithmetic operati ith awkward three 
perations W ith aw J l 
harder on te first problem were more reluctant to shift to other feasible 


proveduresiand: when forced to do $0, shifted as little as possible. This effort 
variable may be an alternative interpretation ftr the elects oë tesa ae 


amount of practice. 
We have seen that theoret 
theoretical fashions of the time; 


ical speculations about sets have followed the 
hence we may next anticipate a cognitive 


type of theorizing. The reinforcement for ~<a = the set during a 
training series could be the subject's knowledge that he has attained the right 
Solution. The problems in the experiments above i solutions that are 
immediately recognized as such by the subjects; no external confirmation bý 
the experimenter is needed. When the subject encounters 3 different type 
Of problem in a test series, he faces a cognitive conflict in that the problem 
does not ft his expectations. If he applies the practiced method, he will 
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recognize the outcome as a failure. It could be this mismatch between what 
he perceives and what he is set for that forces him to change his set. ar 
an interpretation offers a modern alternative to the interpretation in terms 0 
competing responses, and would predict discontinuity 


in solution times for 
a series of simple problems. 


THEORETICAL DIFFERENTIATIONS 

If the notion of a problem-solving set is to h 
must be differentiated from some other, related notions, The set is not the 
response, and the development of a set is not the same as the acquisition of 
the component response patterns, the writing, spelling, adding, and sole 
ing. These have been learned at some time in the past, not in five minutes 0} 
practice with seven problems. What is dev 
one or another of the alternative patterns when appropriate. 

The set is not the same as attention. The individual does not look at the 
most salient or the most intense stimuli in the fie 
first, but the concept of set implies reduction : 
The property of attention that is especially relevant to problem solving 1$ 
selection. One cannot attend to everything, and, once a set is established, one 
does not attend to as much as he can. In the experiments on figure concept 
problems, for example, after the subjects were se 
in terms of shape, they did not attend to textur 
back to perceive texture similarities, 

The essence of the set is a 
experiment, when perceptio 
easy, it is obvious that the 


ave any precise meaning, 1t 


eloped is the readiness to execute 


Id of view; he may do so A 
of this trial-and-error search. 


a ae 

t to conceptualize the Bgu 
. ri A 

e and had, in fact, to switc 


prepared adjustment. In the simple reaction time 
n of the signal and execution of the response are 
critical feature of the set is the neuromuscular 
readiness. In the word association experiment, it is obvious that the set some- 
how prepares memory or recall so that when the stimulus word is heard, aN 
appropriate response word occurs quickly. In problem-solving experiments, 
where the problem situation is complex and several alternative response por 
terns are possible, establishment of the set means that the critical features 0 
the situation have been identified (as a result of instructions and perhaps of a 
training series) and a response pattern integrated, so that the subject is pre- 
pared to respond quickly, without attending to other possibilities, when an- 
other problem is Presented. Putting the hyphens in situation-and-goal set, a$ 
Woodworth did (1937), emphasizes the fact that the coordination between 
the response and the critical features of the situation is prepared in advance- 

The difficulty in theorizing about set in conventional terms is that it ca? 
be established by instructions as well as by training. There are certain ad- 
vantages in trying to fit problem-solving activities into the theoretical frame 
works that have been useful in research on animal learning and on human 
learning of simple tasks, just as there are advantages in putting problem 
solving into the Cognitive framework. But there are disadvantages also, and 
these show up prominently when we come to the effects of interpersonal 
communication on problem solving. Telling a college student how to avoid 
errors On a pursuit rotor task or how to memorize nonsense syllables has only 


we 
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limited value, but telling him the relevant attributes in a concept problem 
or the class of words that are solutions to anagrams reduces the solution time 
drastically. One sentence of instructions might be equated with 20 minutes 
of practice. The capacity of the literate, socialized person to prepare a com- 
plex adjustment of his perceptual and motor systems for anticipated problems 
merely by listening to brief instructions is a distinctly human capacity. Any 
theory that deals with instructions under the heading of standard conditions 
or as a minor methodological nuisance cannot go far. 

One criticism of the concept of set is that it is vague. We have seen that 
sets can easily be established with predictable consequences; the vagueness 
does not lie in the empirical facts. What is felt to be vagueness may be the 
difficulty of fitting these facts into theories that are inadequate. There are 
many mechanical and electrical analogies, the most familiar being the gear 
shift on an automobile. Moving the shift lever does not move the automobile. 
It prepares an adjustment of the gears so that the car will go forward, or 
backward, when the power is applied. The gear-shift mechanism is not the 
same as the other parts of the drive train and automobiles have been built 
without gear shifts. What the changeable gears contribute is just what the 
set contributes: flexibility of control. 

The problem-solving get will be properly interpreted only when the cogni- 
tive side is taken as seriously as the response side and when the capacity of 


the literate individual for immediate temporary adjustment as a result of 


social communication is fully appreciated. 


Seeking Information 
problems a process of information seeking can 
Those concept experiments that allow the 
subject to choose some cards rather than others or to switch lights from one 
display to another and the anagram experiment that photographed ORONS 
ments all reveal something of the subjects search Re useful Ls tn Soe 
Ae > i -pressly to differentiate this process 
Other experiments have been designed expre : Skins ell i a 
from others and to study it separately. In o m og 7 > z eu me) 
; 2 z i evant information is 
nostic or troubleshooting problems, finding the re be vičwed as an cl S 
main diffculty. The information-seeking Process CES 7 : at 
tion of the cognitive side of the set- The initial eps necessarily v gen- 
d e set 1S narrov 
eral, but as information is uncovere oe wea tos 


Search for more specific possibilities. 


In the solution of many 
be identified without difficulty. 


d, presumably 


DIAGNOSTIC PROBLEMS 
The troubleshooter can usually 
Ponent in the system that has bro 
and work, so questions about the or 
are the determinants of informatio 
shooters use the optimal sequence? 


fnd the trouble if he checks every com- 
ken down, but this may take some time 
der of checking components aside: 
n-seeking sequences? And do trouble- 
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To answer these questions, Detambel and Stolurow (1957) built a device 
with three components, each of which could be labeled “OK” or “Out of 
order.” But these components were covered with lids that could be lifted 
for inspection only by removing some screws by hand: seven screws for Com- 
ponent A, four for Component B, and one for Component C. If we consider 
the number of screws, indicating the work to be done or the cost of the 
information, as one independent variable, the other was the probability of 
malfunction. Component A was defective in half the problems, B in a third, 
and C in a sixth. The optimal sequence depends on both probability and 
work. When we have all the data—as we have in Table 4.3 for one prob- 
lem—we can calculate that the optimal sequence in the long run would be 
C, B, A. , 

College students were paid to find the troubles in 200 such pieces of equip- 
ment and were told that they could quit when finished. The amount of work 
required to lift each lid was apparent from the number of screws in view, 
but the probability of finding the malfunction in each component came only 
from cumulative experience with the problems. More than half the subjects 
began by minimizing work, that is, by unscrewing the lid on Component C. 
Then, as they learned the probabilities, they gradually shifted toward lifting 
A first to maximize the chances of a quick discovery. Many shifted back to a 
work-minimizing strategy later, but the authors concluded that the subjects 
generally emphasized probability more than work. It is true that the sub- 
jects were influenced by both variables in secking information, but only l 
of the 12 adopted the optimal sequence and stayed with it. 

Another experiment of this type (Goldbeck, Bernstein, Hillix, and Marx, 
1957) employed two systems of electrical components, one simple and one 
complex, to determine if college students could learn to use an efficient 
troubleshooting method called the half 
ponents all equally liable to malfunction, the most efficient procedure is to 
dichotomize or split the system in half and check one half; whichever half 
is defective is split again, and so on. The results of a training session showed 
that the half-split method was quickly mastered in simple cases when each 
unit in the system was easily understood. When the system was complex, 


split. When there is a chain of com- 


TABLE 4.3. DESCRIPTION OF A TROUBLESHOOTING TASK 
OF THREE COMPONENTS 
Probability Number of 
Component of Trouble Screws 
A 500 7 
B 333 4 
C .167 ] 


From Detambel and Stolurow (1957). 
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most subjects did not catch on to the optimal procedure with one hour of 
instruction, but some did with two hours. Those who got high scores on a 
reasoning test caught on quicker than the average. Ability to apply the 
checks at the optimal places required some skill in deductive reasoning when 
the relationships within the system were complicated. 

Dale (1959) allowed his subjects more freedom in looking for the faults in 
sir strategies. Each subject saw a display of 19 
electric sockets and was told to find the one that was defective. Plugging in 
a tester would show a red light if the socket was defective, otherwise a green 
light. Most of the subjects searched by some kind of pattern, for example, 
stepwise. A third searched randomly—as far as the experimenter and other 
judges could determine. 

Then the subjects were told that the electric current was flowing by 19 
stages and that the malfunction could interrupt the current at any one of 
these. The tester, if lighted, would show that all was functioning properly 
up to that point. The experimenter called this an ordered system in contrast 
to the others in which the items were independent. On problems involving 
subjects used the optimal half-split strategy 
or less systematic. They did improve a little with practice 
i s who got high scores on an intelligence 
ion-seeking strategies on both ordered and 


a system in order to observe thei 


an ordered system, only a few 
but many were more 
on three problems. Those subject 
test used more systematic informat 5 
unordered problems and therefore were generally more efficient. l 
A special problem board with shutters that could be moved aside to dis- 
close the information underneath was devised by Neimark (1967), who 
arranged information in different patterns so that certain moves apne pe 
a safe information-gathering strategy and others a gambling en, “A safe 
Strategy for this problem, like the half-split strategy for Chee yee ems, 
eliminates half the alternatives with each move and thus obtains complete 
information with the smallest average number of moves. The gambling 
Strategy is a move that might, with good luck, uncover the needed ee 
on the first try, and the experimenter manipulated the nes o a ems 
on which gambling would be successful by manipulating the information 


revealed. Analysis of the first moves on successive problems indicated that 
sera profit by their experience in this situation and 


college st able t 
udents rere able o 
e S to guide their moves. Those who were never 
a veloped a safe strategy and stayed with it. 
tly paid off leaned toward 


ling moves frequen r 
he results came closer to a theoretical model 


ld make rational decisions than to other 


) 96 children (Neimark and Lewis, 1967), 
ere random at the nine-year level, but the 
those who were intellectually 
ith mental age. This rather 
learning rate, but to an 


to formulate strategies, Or rule 
reinforced for gambling soon de 
Those who found that their gamb 
the gambling strategy. In general, : 
that assumed college students cou 
models considered. 

When this task was given to 
their information-gathering moves W 
optimal strategy was frequently employed by 
More mature, as shown by a correlation of .52 with 
high correlation was attributed, not to an increase 1n 
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increase in the number of children, at mental age of twelve or above, who 
could profit from their mistakes in an all-or-none fashion, and work out the 
more logical strategy. 

In high school and college also there are substantial correlations between 
the use of an optimal strategy for seeking information and such standard 
measures as intelligence and average grade (Rimoldi, Fogliatto, Erdmann, and 


Donnelly, 1964). 


GUESSING GAMES 


Information-secking strategies show up also in simple card games, as when 
one person tries to guess what card another person is thinking of. D. J. Davis 
(1967) manipulated the structure of the display of cards to see how this 
would influence the questions asked. He placed 16 cards before the subject. 
each labeled with a letter or a number, one of which was the correct card. 
The subject's task was to determine the correct card by asking questions, to 
which the experimenter answered only Yes or No. “All cards had equal 
probability of being the correct one, hence the optimal information-seeking 
procedure in the long run would be successive half splits, as in the problems 
above. 

When half the cards had letters and half had numbers, the perceived 
structure of the problem favored the half-split strategy and all subjects used 
this as an opening question. First, they arranged the cards in two rows, thus: 


ABCDEFGH 
12345678 


With this structure in view, 
or “Is it a number?’ 
information-seeking 


2 A eTa i p 
the obvious first question would be “Is it a letter? 
The perceived structure thus encouraged the optimal 
; strategy. For other subjects, more letters and fewer num- 
ers were presented; hence questions based on the letter-number dichotomy 
ven no longer be optimal, and such questions were actually less often asked. 
n the extreme case, with 15 letters and 1 number displayed, this question 
eae os at all; more awkward but more efficient questions were used. 
e té . Š ia wy f r > 
M amiliar game of twenty questions is psychologically intriguing because 
e questions put the player's thoughts on record. In this game, one player 
asks questions to which the i? p B a c 
i ques “i o which the answers are only Yes and No, in order to find out 
what it is that th i rare j 3 > êm 
e other is thinking of. Many systems or strategies are €M 


ploye: “peri 
ploye d and experienced players are more efficient. In one experiment (Faust, 
) covering five days, it appe 


much of asking better questions 
little information. 


Children enj i ? i E 
infona = Da games like Twenty Questions and sharpen their 
information-seeking ski > ` A ae pi 

; h § skills while playing them. We expect them to ask better 
Seal a mature, but we also expect the changes in their questions tO 
fit in ee what we know of cognitive development in gencral. Mosher and 
Hornsby (1966) looked for age changes by the simple procedure of showing 


ared that the improvement consisted not s 
as of eliminating poor questions that vielded 
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cach child pictures of 42 familiar objects and requesting him to find out “which 
one it is that I have in mind.” Yes and No were the only answers to the 
children’s questions. The optimal strategy, the half split, is hardly applicable 
to this collection of miscellaneous objects. The next best thing would be to 
that groups a large number of specific pos- 
sibilities into 2 domains, in one of which the correct answer must lie. “Is it a 
toy?” “Does it cut things?” This strategy was called constraint seeking. At 
rategy, if it deserves the name, of hypothesis scan- 
ning. “Is it the hammer?” The most distinctive feature of the results was that 
the six-year-old children were limited to hypothesis scanning, while con- 
$ istic of eleven-year-old and, to some extent, 
ith general questions and 


begin with a general question 
the opposite pole was the st 


straint seeking was character 
cight-year-old children. The older children began w 


used the information thus obtained to formulate narrower ones. 
a verbally described incident, 


A more difficult form of this game involves 

: s P ‘ 

such as: “A boy goes home from school in the middle of the morning.” In this 
goes 


case, the children had to generate alternative explanations for themselves 
before asking questions. The cleven-year-olds established contraints and asked 
series of connected questions much more than the six-year-olds, but on these 
unbounded verbal problems the eight-year-olds were not much advanced 
bevond the six-year-olds. The authors, following Bruner, state that: 


ymbolic representation rather than on rep- 


resentation by action and image—and thus is able to construct groupings of ob- 
jects and events on the basis of characteristics they have in common—he attains 
the structure of information necessary to follow tirough a SA acre 
ing s 3 aa RE ; questioning that narrow from the 
ing strategy, to guide his inquiry by cycles of q S 


general to the specific [p- 101]. 


as the child increasingly relics on s 


SET IN READING 
d to certain possibilities rather than 
tuation or to recall them from 


THE INFORMATION-SEEKING 


As the information sought is narrowe 


others, a set to perceive these in the present si i ! 
the past is establish d. Th n between information seeking and 
ast is established. 


t can be illustr 


e close connectior ¢ rmat 
ated by reading, which is, after all, the 


formation for solving problems is gathered. Reading 
a highly practiced skill, but it is swayed, like 
> onitive demands of the moment. A set 
r than another can be established, like 
mation in previous problem solving 


a problem-solving se 
method by which most in 
by older children and adults is 
other intellectual activities, by the cogn 
to read for one kind of information rathe 


other sets, by s ful use of the infor 
, by success ul use i : . à i 
or, as Torrance and Harmon (1961) did, by instructions. They had graduate 


Students read assignments in preparation for an mene and gave different 

grou r sa $ One group was told to read with a memory set: 

Sroups different instructions. g tina T 

Remember everything.” Another group was a ad creatively: 
A. D 


“As you gana thiolsok ways that you can use this information. Think of many 
IC as e `. «Try to find defects in the research. Ts 


uses.” A, aluation: 
k st ie evaluatio ) 
nother stressed eva s» When they were tested on memory 


what th ; i eally trues 
he author is saying Teal’) . payani ? 
creative, and evaluative tasks, the differences came out as expected. Those who 


had studied with a creative set did best 0n the creative problems; those who 
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had studied with an evaluative set did best on the evaluative problems. Dif- 
ferences on the memory tasks were small—probably because all graduate 
students are habitually set to remember everything. 


SOME GENERALIZATIONS 
Despite the diversity of materials and proce 
seeking, when the results are lined up, a substantial amount of agreement 
appears, justifying some general conclusions: (| 
procedures, all requiring overt move 


dures in research on information 


) A variety of very simple 
ments or questions, have been quite suc- 
cessful in producing objective evidence on 

(2) Typically, the subject is set for acquisition 
but he can also get set to make 


information-secking activities. 
and retention of information, 
creative or evaluative use of the information 
obtained by reading. (3) Most subjects improve with practice, if the informa- 
tion is not too complex, by taking advantage of the structure of the informa- 
tion and eliminating inefficient moves (4) Intelligent adults tend to approach 
logically ideal strategies, but they seldom reach the ideal. (5) The complex 
problems of this type require a series of decisions, which mav be logically 
arrived at, hence they resemble re 

success in reasoning tests. (6) 
the information needed, they shift from guessing about concrete facts to more 
organized strategies that move from broad cate 
subcategories, 


‘asoning problems and success correlates with 
As children develop the ability to conceptualize 


gorics of information to narrow 


Representing and Transforming the Data 


Let us consider a simple everyd 


. ee sople 
ay problem. We are expecting certain peo] 
to meet in a certain room, 


so the problem arises of supplying a chair for each 
person. One procedure would be 
people. Let each person pick a ch 
over. More likely, someone 
determine arithmetically 


a direct physical matching of chairs and 
air and see if any chairs or people are E 
will count the people and count the chairs anc 
if the two counts match. In gencral terms, we can 
say that the essential elements of the problem are represented syrmbolicaliy, 
and the problem is solved symbolically; then the symbolic solution is appliec 
to the real problem. The first step can be likened to encoding, the last to 
decoding, 

This representative process does not always occur. If the problem is a con 
struction one, such as a jigsaw puzzle, it is usually solved by manipulating 
the pieces directly, Anagrams are solved by rearranging the given letters: 
But the process of representing the data of the problem in some other form 1 
a common phase of problem solving that merits systematic consideration: 

One simple type of representation is the labeling or naming of singular 
concepts. When the hostess solves the seating problem for a dinner by arrang- 
ing name cards around the table, the function of each card is to represent the 
Single person to whom it refers. On maps, specific cities, rivers, and other 
geographic features are represented by specific names. But a symbol may also 


ye 
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represent a class of objects, as when the hostess uses square cards for male 
guests and round cards for female guests. The admiral solves some naval 
problem by letting blue markers stand for ships of one class and red markers 
for another, then moving the markers in accordance with his knowledge 
ass, Mathematical symbols, of course, representing 
abstractions, have the greatest utility in 


relations, operations, and higher 
problem solving. We are so accustomed to the symbolic representation of 
inded occasionally that this is a 


objects and classes that we need to be remi 
highly cultivated accomplishment. According to the preceding chapter, each 
of us reaches this stage of development only after years of representing reality 


of the capabilities of each cl 


through images and actions. 
Although a problem is, by definition, a novel situation, each adult has 


a repertoire of symbols that are potentially applicable. One of the first steps 
toward solving many problems consists of representing the given facts in 
Conventional terms, then structuring the relations between them and the 
specifications for the desired solution in sentences: a verbal conceptualization 
of the problem. Some problems can be formulated in mathematical terms: 
Let X represent the number of trials. Let Y represent the proportion of errors. 
What is required is... The representation of geographic conditions, human 
and other abstract complexities in verbal or 
entials of the problem can be put down 
identify the data that are missing 


activities, economic events, 
mathematical symbols so that the 
on paper” is a major step forward. 


and lead to more enlightened inform : kot 
When problem solvers and teachers of problem solving speak of “conceptu- 


Bie seve ae Ei mod wre ` om,” they are not speaking 
alizing,” or “formulating,” or “setting UP the problem,” they peaking 
of the words, numbers, letters, and other terms, but of a structure of terms 
i of the problem. Usually the terms are familiar 
novel structure, organized to fit each new 
osing a set of terms that is not appropriate 
a structure of terms that does not accurately 
m. They also occur in the decoding opera- 
the real world, as well as 


ess 
This may 
jation seeking. 


that represents the structure 
but the structure of terms is a 
Problem. Errors are made by chi 
to the problem and by arranging 
represent the structure of the proble: 
tion, that is, in applving the symbolic solution to 
in encoding. i 

` Because of the spatially ext 
tionship between elements Car 
rari Charts, maps, schemati 
ogical relations, electrical circuits, 


C a È . . qc 
ode the relevant relationships in VIS" 
e not amen 


tem, the rela- 


ended properties of the visual 
a often be represented graphically with special 
c drawings, diagrams of mathematical and 
musical scores, and other schemes that 
al form take advantage of these proper- 
able to verbal communication. In 
represented visually, one can “read off” the 
thinkers report that they use their own 
mplex data. These may be images, 
not easily communicated, but all 
time only in the minds of original 


svn to show structures that ar 
Some cases, when the facts are 
Solution directly. In addition, some 
idiosyncratic schemes for representing co. 
vaguely formed patterns, Or heuristics 
representational systems existed at one 


thinkers, 
? as the Mercator projection and th 
The general adoption of a system—such Pre) ig 
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n Bie ake it 
Linnaean classification system—depends on the thinker's ee een 
explicit and communicate it to others, Thus, w hile most ae x iR 
standard structures for representing reality, a few are trying | aaa 
their own visions of reality. The other side of the <p ase 
exemplified by the scientists who puts down on paper the imaginary 
that evolved in his own fantas naa A 
novelist, who dreams of a locale, peoples it with characters, and engag 


at others 
ie rinted page so that © 

in action, then represents his private world on the printed page 

can share it. 


Many problems can be 


DE T sad of We 
Y: Even more inventive Is the work 


solved, 
more easily solved, or perhaps can only ae 
when the information about the problem represented in one Se rie 
formed to another. Solution may be aided by translating a verbal desc er 
of the facts into a set of numerical statements, or by representing num 
data in graphic form. Smaller transformations 
coordinates to polar coordinates, from 
or from stimulus-response l 
useful transformations are 
served through the 
be decoded back to 


at le 


yueacianl 
ENEA E tear Pipes 
a decimal system to a binary © ally: 
to cognitive language. More pena ë 
those in which the structure of the data is | 
transformation so th 
the origin 
ast the level of cognitive 

Despite the importance of 
evidence is meager. One 


anguage 


~an 
at the solution in symbolic temma a 
al problem. This achievement seems Ti i l 
development that Piaget calls reversibi —_ 
this process for problem solving, experi 


» psy 
; asked how PS) 
unpublished study by Curt Wilson asked d foune 
a > x ee, ` an 
chology students represent the data of a psychological experiment oo 
that those who took the trouble to draw 


» best € 

a graph formulated the | rds, in” 
. 5 cards, 

sented as a series of statements on Jata 
the da 


clusions. In this å 
cluded an 


had been converted to graphic 


In any case there 
ferent represent: 
(1967) structure of intellect, 
preferences because we know 
graphic means whene 


ase the data, pre: 


interaction which was obvious when, 


form. 
are individual diffe 
ational Systems, 


and only when, 


rences in the ability to handle dy 
which appear, for example, in aa Jving 
These may be related to problem:so s 

that some people prefer to solve problems 


i ana 
’ > on, í 

ver possible, others prefer a numerical saran a at 
others typically substitute words for numbers. They may be relatec 


«acts 
Cess also, if an experiment by Gavurin (1967) with a small sample of ae 
is confirmed with a | €. On the assumption that the letters yurin 
anagram are visu d and rearranged in imagination, C 

ith scores on a standard test of T ha 
of .54. When the letters were on tiles 


bly 
‘ s resuma®. 
the correlation was negligible, pres 

S Unnecessary, 


could be Physically 


rearranged, 
because Spatial repre 


sentation wą 


rganizing Cognitive Structures 
Structure has many 


can 
i A j ing we 
meanings; of those involved in problem solving more 
say that a structure Consists of an Organization of elements which is 
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than the elements and which persists when the elements change. A triangle 
drawn in black ink on white paper, for example, could have the same struc- 
ture as one drawn in white on black. As we noted earlier in the discussion of 
embedding, an element is hard to pick out when its outlines coincide with 
those of the larger structure. One may restructure what he perceives by look- 
ing at it from a different point of view, by breaking it in half, by rotating it, 
by coloring one part, or by reversing figure and ground. Some relations 


within the structure, necessary for solving a problem, are apparent only 


after such restructuring. 
The concepts of structure and restructuring, or reorganizing, are favorites 


of Gestalt psychology and are usually illustrated with examples from visual 
perception. But abstract cognitive structures are more relevant here. If we 
have A and B classified under I; C and D classified under II; and I and II 
classified under 9, the structure is a trilevel binary structure that is different 
from a list of seven elements (see Fig. 4.9). An illustration of restructuring 
would be the reclassification of C, putting it under I, and a more radical 
case would be changing the whole to a bi-level structure. Figure 4.10 tep- 
resents two family structures composed of father, mother, and three children. 
These figures are representations; we cannot draw a family. We can, however, 
represent graphically the relations between the elements, that is, the structure, 
of an organization so that the difference between two structures, as in this 
figure, is obvious. , . 

The most important structures for problem solving are dynamic structures 
of behavior, some of the elements of which are moves or ieee Fe 
game, for example, includes not only such elements as payas and balls, che 
a variety of plays, all of which are replaceable although t a a 
been quite stable for many years. Card games supply other familiar crampes, 
The elements of the game may be visible objects, such as bats, balls, e 
or spades; visible moves, such as swinging a bat or playing B ea sai ta 
concepts, such as strike, trump, or win. An integrated series 0 a suc a 
a forcing play, may profitably be treated as a substructure. ome complex 
dynamic organizations of objects and functions are called systems: a ventilat- 
ing system, an accounting system, a wiring system Sw va aid Seay Bt 

The intricately structured behavior of a spider pec a it aTa 
building a nest suggests a genetic mechanism for es a tS eect aie 
integrating substructures into the larger structure, adap <i paige 
structure of the environment. The structures of human 


i z nvironment and on 
depe divi erience in adapting tO tes 
pendent on individual exp! a significance of the elements and 


indivi k unctional ‘ è 
dividual knowledge of the fun SK association might be considered 


Substructures in the overall structure. An 


FIGURE 4.9. Representation of a tri-level 
"nary taxonomic structure. 
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M Ẹ 


+ 


ii i S 
—t—\ FIGURE 4.10. Representation of two family structures 
A B C A B C consisting of father, mother, and three children. 


a simple structure, and a chain of S-Rs would be a more complex serial 
structure, all executed mechanically. But if an individual is to make flexible 
use of a cognitive structure in solving a problem, he must know the inst rumen- 
tal Function of each component; then, if a component is missing or defective, 
he can replace it. Obviously, a knowledge of the elements, the substructures, 
and the overall structure facilitates solution of problems involving the structure 
and, in the case of games, the participation of the individual as an clement 
in that structure. Thus many of the more enlightening experiments are con- 
cerned with knowledge of the components of some larger structure. 

The relation of the part to the whole is an old controversy in psychology, 
and many of these questions have been discussed in other terms. Knowledge 
of the overall structure and of the functional relations of the constituent 
elements is often called understanding in contrast to 
of the elements. It makes little sense to speak 
simple S-R association; only when behav 
one ask about understanding of the components and their function in the 
whole. Bartlett (1958), for example, treated thinking as filling in gaps in a 
system. Another old term, recently used in 
which, according to Miller, Galanter, and Pribram (1960), is “any hierarchi- 
cal process in the organism that can control the order in which a sequence of 
operations is to be performed [p. 16].” Thus plans and the integration of 


subplans within plans give behavior its structure. De Groot (1965), following 
Selz, noticed that the thinking of maste 


the formulation of a plan and that the moves were integrated with, and 
realized, that plan. i 


a fragmentary knowledge 
about knowledge of a reflex or 
ior becomes more complicated can 


a more precise meaning, is plan, 


; ; 
r chess players was characterized by 


The desired solution to a problem may be described, especially by writers 
of mystery stories, as the missing piece in the structure. In some cases, 
problem solving consists of organizing the known pieces into an incomplete 
structure or search model, then se 
for a missing piece that fits. This 


solution must fit certain specific 


arching the environment or one’s memory 
notion of a search model implies that the 


: ations, that these specifications are organized 
rather than just listed, and that accuracy and completeness in the search 


model help the solution. Unfortunately, the idea of a search model, though 
intuitively plausible, has not generated relevant empirical research. But recent 


research from the verbal learning laboratory under the name of “free recall 
has documented the influence of clusters, y 


and other organizing principles in the produc 
A persistent methodological question aris 


i ogies, 
ules, coding systems, steg 
tion of words previously learned: 

: h a 
es at this point: How can W 
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vs the structure? The spider does not know 
the structure of the web that he spins so expertly. A beginner may play a 
game quite well for some time before he understands how the elements are 
integrated into substructures and how manipulation of these affects the 
whole. Structures may be inferred by an experimenter which have no signifi- 
cance to the subject of the experiment. One obvious method is to ask the 
subject of an experiment to represent a structure somehow. He may do this 
graphically Cas in Figs. 4.9 and 4.10) or by stating the rules of the game 
verbally, by executing a series of gestures, Or perhaps by mathematical equa- 
tions. The study of the thinking of expert chess players by De Groot (1965) 
was made possible by standard schemes for representing moves and combina- 
tions of moves, schemes known to both the players and the experimenter. 
Adequate representation requires that the structure be preserved through a 
transformation of elements, as from pieces on the chessboard to symbols on 
the printed page. Just as we take a definition or a classification rule as one 
measure of knowledge of a class concept, we take a description of the rela- 
tions between members of a family or a diagram of an electrical circuit as 
Measures of knowledge of these structures. 

It is possible also to ask which of five alternative representations of the 
structure is the correct one. Is the family next door represented by the left 
side or the right side of Fig. 4.10? The limitations of these methods are 
parallel to the limitations of similar methods for measuring knowledge of a 
class concept. 

Another method is based on observation of the attempts to solve a problem 
which involves the structures of interest. The subject's arrangement of the 
materials, use of tools, search for information, sequence of arithmetic opera- 
tions, and consistent errors May permit inferences about the structure he is 
assuming. People solving arithmetic problems, construction problems, Jigsaw 
puzzles, and concept problems can often be seen arranging terms more con- 
veniently or lining up some pieces in a row for comparison with others. In 


many card games, the cards come at random and the average player's first 
7 in a manner that he assumes will be helpful. 


does not solve any problem; it merely facili- 
arrangement of the materials frequently 
hin the materials and his plan of 


determine whether the subject knov 


Operation is to arrange them 
This is a preparatory process that 
tates the next step. But the subjects mm" 
reveals his understanding of relations with 
action, 

When the game is played or 
Many observers (e.g, De Groot, 
tary moves are often integrated int li ler sol 
that is then executed as a unit. Likewise when the goa ap wes ex > u- 
ti iti ; i ji e expectation of synthesis later. 
lon, it is often analyzed into subgoals, with th p bl K aon bebayi 

More formally, the experimenter predicts certain problem-solving behavior 

i cture of the problem, then takes the 


from a h is 2 þject’s stru 
a hypothesis about the subj merpat a diss bpoiheds. The 


observed behavior as support, OF lack ol ‘aa . x 
limitations of this method also are parallel to the limitations of corresponding 


: t. Naturally the converge 
methods for assessing knowledge of a concep 7 gence 


the problem is solved by a series of moves, 


1965; Gyr, 1960) have noticed that elemen- 


o a combination of moves, a substructure, 
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of inferences from a representation of the structure and from problem-solving 
behavior would be more convincing than either one alone. f 

The question that has concerned most investigators is whether the subject 
knows the structure that is necessary for solving the problem. Success is more 
likely if the subject's structure of the elements corresponds with the objective 
structure. If he does not know the correct structure, we may also ask, as we 
do in the case of concepts, what is the idiosyncratic structure that he is as- 
suming? 

In the case of complicated structures, knowledge of the structures may be 
partially correct, rather than cither right or wrong. One person may know 
enough about an electrical circuit to turn the right switch, another may 
, and another may be able to redesign 
the system. To put this point more generally: some problems require a higher 
criterion of attainment than others for solution. 
relations are relations of inclusion. One who 
nows that whatever is true of I as a class 
re, such as Inhelder and Piaget studied, 
- The important structures for the solu- 
are made up of functional, means-end, or 
cause-effect relations: the relation between a child’s attention-secking activi- 
ties and the mother’s responses Cand perhaps the modulation of this by the 
presence of the father); the relations between the elements of a dynamic 
system of switches, meters, lights, and resistances; the relevance of a certain 
i f arithmetic operations; the fit of a tool to a 
the articulation of an administrative procedure 
with a system of administrative procedures, According to our definition of 
problem solving, if the subject knows all the parts and how they fit together, 


he has no problem. Problems arise from ignorance of the parts or of their 
Organization. 


PRINCIPLES, PROPOSITIONS, AND EXPLANATIONS 
From the psychological point of vi 


inciple i iev t 
ew, a principle is an abstract achievemen 
of special use for the soluti 


on of problems. It has the form of a proposition or 
edicate. Birds fly. Babies are cute. Sugar 


: with the plural nouns. Work = force X 
distance. 


Although the term c 
can be made, In many 
as dax, and uses it discr 
out asserting any 
Correct classificati 


oncept is often used to include principles, a distinction 
laboratory studies of concepts, one learns a word, such 
iminatively in classifying and pointing to objects with; 
more about dax than when one says, “This is a baby. 

on and correct usage are the tests of achievement. In educa- 


and principle are used interchan 
A principle is m 
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statement that birds Ay is an abstract generalization relating a class of activi- 
ties to a class of objects. Thus we can treat the principle és the structure and 
the concepts as the elements. But a principle is more than a conjunction of 
concepts. Combining the concepts square and red to form the concept red 
square is not stating a principle. The distinctive feature of the principle is 
that concepts are combined in a special relation: the subject-predicate rela- 
and predicate may include substructures, with modifiers 


tion. Both subject 
ntence structure; but the subject-predicate 


and exceptions, in a complicated se 


relation is the kingbolt of the whole. 
Principles have a close connection with explanations. Explanation may be 


a difficult notion technically, but in psychological and educational contexts 
we count a reply to a question about an event as explanatory if the applicable 
principle is stated. Thus thinking is often initiated by a question—posed by 
the thinker or someone else—and terminated when an explanatory principle 


is found. 
The close connection between principles and problem solving has been 


emphasized by Gagné (1965) in the following statement: 


l em solving involves the combining of previously 
earned principles into a new higher-order principle that “solves” the problem 
and generalizes to an entire class of stimulus situations embodying other 
problems of the same type. Problem solving occurs when the instructions pro- 
vided the learner do not include a verbally stated “solution,” but require him to 
construct such a solution “on his own.” When this happens, the individually 
constructed higher-order principle is effective in generalizing to many situations 
and is at the same time highly resistant to forgetting [pp. 165-166]. 


To summarize, discovery or probl 


es, like the achievement of other standard 
be tested by several methods, five of which 


are listed in Table 4.4 for comparison with methods of measuring achievement 
of concepts. We can observe someone's use of the term for the principle 


ìn writing or speaking and note proper usage and gross errors, or we can 
vhich call for discriminating use of the term. We can ask 
he principle or, if appropriate, for a mathematical 
we ask someone to identify objects as 


The achievement of princip! 
abstractions of the culture, can 


_ situations v 

or a verbal statement of t 

Or or. S A 3 

£ graphic representation. Or, Just as 

TABLE 4.4, companisoN OF SOME MEASURES OF ACHIEVEMENT OF CONCEPTS 
AND PRINCIPLES 


Ce Principles 


Use of term in communication 

Verbal statement on request 

Classification of situations to which 
principle does and does not apply 


Given in explanation of an event 
olution of a problem 


i ; ets ae 

— of term in communication 

os definition on request 
lassification of positive and n 
instances 


egative 


Application to s 
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positive and negative instances of a class, we can ask him to identify those 
situations to which a certain principle is applicable, or to select that one of five 
principles that is applicable to a given situation. 

The special functions of principles, however, are in the explanation of 
phenomena and in the solution of problems. So the achievement of prin- 
ciples is most often tested by a Why question about an observed phenomenon, 
or by a problem, such as a geometry problem, for which the principle is sup- 
posed to be necessary. 


LEARNING PRINCIPLES 

The learning of principles has not been investigated as thoroughly as the 
learning of concepts. It is reasonable to suppose th 
somewhat as the more abstr. 
variables emphasized by 


at principles are learned 
act concepts are learned, and that some of the 
the results of concept research—such as relevant 
and irrelevant variables, positive and negative information, and salience of 
the critical dimensions—will be important here also. 

Duncan (1964) studied the 
numbers with certain letters 
his subjects guess at the re 
They were more 


induction of a simple principle that paired odd 
and even numbers with other letters by having 
lations, and then informing them of their success: 


successtul—as measured by numbers of pairs required and 
by number of correct statemen s of the 


principle—when the experimenter 
permitted them to select whatever p 


airs they wished than when they were 
restricted to pairs chosen by the experimenter. This finding agrees with the 
comparison of the selection and reception methods of learning concepts. In 
this relatively simple situation, it is advantageous for the subject to be able 
to test his own hypotheses in his own way. In general, those subjects who 
succeeded were more systematic in their approach to the task, varying only 
one variable at a time. 


The nature of the relation between the statement of a principle and the 
behavior that illustr. 


ates it is an important que 
yet thoroughly ; 


stion, frequently asked but not 
answered. Presumably, there 


a mutual facilitation between 
the formal statement of a principle and its application to conerete situations, 
so that the learning of either one assists the learning of the other. But we 
know that verbal expressions of a principle, for example, frustration leads to 
aggression, are often memorized without regard to observable face And a 
tactful person in a social situation may reveal behavioral skills predictable 


from the frustration-ageression Principle even though he cannot state such a 
Principle formally, 


No matter how the 
are retained better th 
isolated Principles ma 
and Wiegand, 1968) 


in problem solving w 


Principles are acquired, it is generally agreed that they 
an isolated facts—although the retention of concrete 
y be Surprisingly small under some conditions CGagneé 
- The general question of teaching principles for use 
ill be taken UP systematically in the next chapter. 
DEVELOPMENTAL TRENDS 

Games, as noted above, 


provide interesting illustrations of dynamic sys 
tems of objects and actions. 


In his well-known book, The Moral Judgment 
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of the Child, Piaget (1932) described how children explained the rules of a 
game of marbles to him and how well their behavior corresponded with these 
tules. The youngest children did not talk about rules, of course, nor give any 
evidence of following them when they played with marbles. Children of age 
three or so seemed to know that the game has a structure, that some moves 
are right and some are wrong, and they tried to imitate the older children in 
assification at this stage, however, they were influenced 
more by superficial perceptual aspects than by abstract principles. They talked 
in terms of rigid eternal rules even though they did not describe them or 
follow them accurately. Children of seven and eight followed the rules quite 
well. But only the children of ten to twelve had mastered the complex rule 
structure in an abstract way so as to integrate rules and behavior systematically 
and to conceive of the rules in terms of mutual agreements instrumental to 
mutual enjoyment of the game. T 
It is in adolescence also that formal propositional thinking develops, accord- 
ing to the Geneva school. Propositions are stated and tested against observa- 
tions, and specific events are explained by reference toa ps p 
The vounger child, at the stage of concrete operations, is tied to = z 
while the older child looks for general principles. For example, one of the 
tasks used by Inhelder and Piaget (1958) required the subjects to a, to 
hit a target by bouncing a ball off a bank, as in billiards, and to pi w = 
happened. The younger children gave concrete se on ` 
hits here, then it goes there.” The adolescents gave more exp oe 
terms of a gencral a aciple, although not always well expressed, to the effect 
a general princip 


that the angle of incidence equals the angle of reflection. 
PIECES OF A SOLUTION 


ad been exposed to Gestalt ideas at the 
that attracted 


these respects. As in cl 


ORGANIZING THE 
In the 1930s Norman Maier, who h ; ; pone ae 
University of Berlin, began a series of a XE re aibeduent at 
considerable attention and have had a strong in pe . oe os 
lem-solvine research. The problems were rather difficult cons n g k 
le C la ; f component pieces and physical principles. 
ems, requiring the integration. O listinguish the integration of separate ex- 
-easoning to distinguls grasia f 
= 5 fer. Although this term no longer 


and trans 
hether ordinary principles of learning are 


ing activity is still a live one. In fact, 
-i ations of them that challenged 
ations of prob- 


Maier used the term 
Periences from ordinary learning 
Seems appropriate, the question W 
h problem-solv 


sufficient to explain suc agi ee 
rt Was these experiments and Maier’s interpre tionist interpret 
Hull and his followers to work out 7 associationis 
f -> this chapter- 
em solving sketched earlier in this chap aon 
One experiment (Maier 1930) gave college students the ei ae 
i : pares r ji ake chalk marks 
ing 5 J ] two pendulums that would swing to m 
E Sapparta tahale 620 1 i f this problem requires the 
at designated places on the floor. The solution o! ; s pi jea ere 
pra a À a E 
use of several physical principles or techniques, such as wedg p i p 
ing p i attern. And all these parts must be properly 
& as parts of the solution pa a nak H SRO SE er 
combined in the correct organization. Different group: j g 
òo A a 
C1) the problem only, (2) demonstration of the necessary parts o the 


178 *¥ THE SOLUTION OF PROBLEMS 


solution, (3) instructions to combine the demonstrated parts of the solution, 
(4) a hint about hanging a pendulum on a nail in the ceiling, and (5) all 
of these aids. Only in the groups that were given all the aids was there an 
appreciable number of solutions. None of the aids was sufficient by itself. 

Maier argued from these results that knowledge of the component parts 
was not enough; the organization of the parts was crucial. But this research 
has not gone unchallenged. Weaver and Madden (1949) found that they 
could induce relatively more solutions by a thorough explanation of the 
parts. Saugstad (1957) gave his subjects even better understanding of the 
principles, not by demonstration but by having them actually perform little 
tasks using these principles. Then almost all the subjects, under the most 
favorable conditions, were able to apply the principles to the problem. 

Another experiment by Maier (1931) was done with a two-string problem 
that has since been widely employed by others. Two strings hanging from 
the ceiling are to be tied together, though they are so far apart that the subject 
cannot grasp both at once (see Fig. 4.11). The best solution is to tie a weight 
to one string and set it swinging like a pendulum, then run over to the other 
string and bring it to the swinging string at the right moment. Only 39 per- 
cent of Maier’s subjects got this solution in 10 minutes, even though a pair of 
pliers, suitable for a pendulum weight, was in sight on a table. Pliers are 
conventionally used to hold things, so their use as a pendulum weight re- 
quires restructuring the functional significance of pliers. When the experi- 
menter brushed one string to make it swing and handed the pliers to the 
subject, 38 percent more were able to solve the problem. Apparently this aid 
directed their attention to the unusual potential of the pliers. 

A third experiment by Maier (1945) attempted to evaluate the contribu- 
tion of previous experience to the problem of constructing a hatrack from 


FIGURE 4.11. Two-string problem. 


i tri together? From 
Johnson (1961). How can he tie the two s rings tog 
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given materials. Only 6 out of 25 who tried this problem without previous 


experience solved it. Another group had practice on a similar problem that 
and 12 of this group 


included the critical components, as a test of transfer, 
of 25 succeeded. But the best results were achieved by a third group that 
had the initial practice with the other problem and also a view of their 
completed solutions while working on the hatrack problem; 18 out of this 
group of 25 succeeded. Evidently previous practice gives some help, and an 
example of a critical component in view gives more. Maier claimed that the 
transfer by itself was not sufficient. 

The argument for an organizing or structuring process is both negative and 
positive. To say that knowledge of the parts of a solution is not sufficient is 
a negative statement, difficult to confirm. Psychological research seldom ac- 
counts for all the factors that determine results; all experiments leave a large 
Portion of the variance unexplained. Therefore, a hypothesis which predicts 
that some portion of the results will be unexplained is not specific enough 
for us to test it. The positive side is the demonstration that certain condi- 


tions facilitate organization: instructions, practice, hints, and examples. 
x is effective and aids the solution to some 


med components. A second possibility 


is the subject’s unaided restructuring of the functional significance or means- 
end possibilities of an object. A third effect, which Maier has emphasized 
under the name direction and others call set, is that instructions, hints, and 
an example in view all direct the thinker’s attention to the problem-solving 


function of an object that otherwise might be overlooked. 


INITIAL ORGANIZATION OF THE MATERIALS 
Tt would not be correct to assume that solutions to problems are arenes 
from unrelated elements. Even if the materials in view Fhe to ns 
related, the thinker’s associations tO them, his perception x them, a a je 
Cognitive structures by which he conceptualizes err a SS 
initial organization of the materials at hand, and eas Hi — 
may be a help or a hindrance, but it cannot be ignored’ 4 is w “en = 

solving processes include the process of breaking up 2” reorg E 
Mitial patter lem. R e” , 
e dd n f e problem offers a businesslike SeA i fini 
heen eon 48 wag sh SO 

ought it for $70. 
ee da nome a ipa much did he make n the re — 
[p. 305]2” keane to Maier and Burke Gan = pa pte e 
many errors on this problem because the identity of the horse i- ines two 
not an overall structure 


transactions į > ded in this case is 

actions into one. What is neeee f b 
i s can be se 

Ut separation into two structures, and the two transactions ca separated 


; made, they say, when the problem 
o chlorine the Horses. Enas i eens horse for $60 and sold it for 


3S stated as follows: “A man bought a Id it for $90 3091.” 
70. # for $80 and sold it for $90 [p. 309]. 
Eiet ia a a Meral ad construction problems, we might ex- 
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Apparently, previous practice 
extent by transfer of previously lea 
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the letter order, 


inith ii an anager hat is, 3 
pect that the initial organization of an anagram, t anagrams are 


could have an j»bibitory effect on reorganization. Usal aeann 

n aines tes il ao apparent organa D S AS eh 
EZZ E iy RE One eo Gin ee eee reser? tes : 

Ja oa p a i ine pang ne and other confusing 
variables pres nea > cones. Beilin and Horn (1962) constructed a os 
of anagrams in unorganized or nonsense form—for example, erten (for se 
the solution is enter)—and another list, equated for word frequency 
other variables, in word form—for example, cause (sauce). The results of 
their experiment showed that the word anagrams were much more ened” 
the average solution time was 16.7 seconds, as compared with 9.4 secon is 
for the nonsense anagrams. They concluded that a word does resist analysis 
and reorganization more than a meaningless group of letters. The same 
difference was found again with tachistopscopic exposure of these anagrams 
at 1 second and also at a half-second (Beilin and Levine, 1966). 

This inhibiting effect of words is due to the structure of the word, hence 
one might ask if it would apply to children who are not so familiar with 
words. Beilin (1967) investigated this question by giving anagrams in wore 
form and in nonsense form to children of eight, ten, twelve, and fourteen. 
He found the usual difference at age fourteen: 34.5 seconds for five-letter 
anagrams in word form, and 18.0 seconds for equated anagrams in nonsense 
form. Children of eight had a hard time with these problems; the average 
time was almost a minute, but there was no difference between word and 


sis š R k re 
nonsense forms. Thus Beilin’s data disclosed two developmental trends. The 

was a general improv 
there w 


ee 


ement in anagram-solving skill with increasing age; an 
as a special inhibitory effect of word structure, which did not appear 
until around age ten, when the children had become quite familiar with 
the structures of these words (see Fig. 4.12). 

Although the investigation of anagrams in word form was started to test 
a hypothesis derived from Gestalt principles, other hypotheses are possible. 
Ekstrand and Dominowski (1968) confirmed the ‘inhibitory effect but 
Suggested an associationist interpretation. Associations to nonword anagrams 
are likely to be series of letters, which would facilitate solution, while associa- 
tions to words are likely to be other words, which would seldom be the correct 
solution words, Pronouncability is another contributing factor because ana- 
grams in word form are easier to pronounce than nonword anagrams. But 
nonword anagrams vary in pronouncability also, and those that are easy tO 
pronounce—and thus presumably tightly organized—prove more difficult to 
solve than the ones hard to pronounce (Dominowski, 1969; Hebert and 
Rogers, 1966). A little practice in pronunciation decreased reading time anc, 
by strengthening the organization, reduced the number of solutions (Domi- 
nowski, 1969). The apparent implications of this novel line of research are 
that vocal processes intervene in the solution of anagrams and that the vocal 
organization is as important as the visual organization. 


In nonword anagrams, the amount of reorganization required is a function 
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Best whee 


? =n bo boneeo 
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Like Waves, Kan welt 009 Gestalt concept ee and: l ject 
struction problems hA in German in Seana how an ke 
re laera e acta for problem 
solving, by being included as pare of an irrelevant siieip 1 ul a E 
may be taken as an example. The task is to support three ca he Te mian 
door at eye level, ostensibly for an experiment on vision. On a ta fe ee 
objects, including a few tacks and three small boxes of anen rie fae 
colors. The correct solution is to tack the boxes on the door as pla 
the candles. > Seiad ate 
The experimental variable in Duncker’s experiments wos ete the 
rangement of the materials. For an experimental group of seven s an For 
three boxes were filled with candles, tacks, and matches, mae mere 
the control group of seven subjects, they were empty. The airy with 
perception, or cognition, of the boxes used as containers wou ied but 
using them as platforms. All of the control subjects solved the ea with 
only three of the experimental subjects. Similar results were it niet 
several other problems of this kind, and Duncker's a fox ont 
heterogeneous functional fixedness. Cognition of an object re ades JE 
function fixes it to that function Cor means-end relation) T penne 
cognition for another function, a special case of embedding. This nition that 
tion could also be stated as the establishment of a set for one ae aha 
hinders another cognition if the emphasis is placed on the cogniti 
than the executive side of the set. as in what 
In Duncker’s experiments the differences between groups T affect @ 
they perceived; but others have introduced modifications to test t bject, and 
what they learn. The independent variable is preutilization of an o r pee 
the transfer paradigm, helpful in designing experiments on motor = ae 
learning, is helpful also in visualizing the sequence of events . cane # 
functional-fixedness experiments. Task A consists, for example, g aad #5 
box as a container, and Task B consists of a problem. If the box ah sine 
the solution to the problem more often by those who have had Tas a ta 
by control group, the outcome can be called positive transfer. “fe walle 
is used less often by those who have had Task A, the outcome can T atti 
negative transfer, Putting the functional-fixedness experiment into t Ep a 
fer paradigm is helpful methodologically: it suggests appropriate contr haie 
elaborations. It does not eliminate the difficulty of deciding whether th 
sults are due to negative transfer of a response or of a cognition. 51) had 
One carefully designed experiment (Birch and Rabinowitz, 19 witch 
subjects try the Maier two-string problem using either a relay erais oild 
as a weight for the pendulum. The test was not whether the subjects a mes 
solve the two-string problem but which of the two objects they woul 5 
to solve it with. A preliminary experiment showed that the two were u 
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ive the probiem under standard conaions. ine preuuiiza- 
lay or the switch in constructing a simple 
dulum problem later, those who had used 
the relay now used the switch, and most of those who had used the switch 
now used the relay (see Table 4.5). In general, 17 out of 19 subjects avoided 


the object that they had just used for a different purpose and made use of 
ed. Other experiments by similar methods have 


~ tion consisted of using either the re 
electrical circuit. In solving the pen 


the object not previously us 
also confirmed such functional fixedness. 

Research on other problem-solving processes, especially sets, has stressed 
the importance of instructions, and in one of Maier’s experiments the in- 
structions directed the subjects’ attention toward uses of an object that would 
otherwise be overlooked. Instructions were also found to influence solution 
of the two-string problem, by limiting or extending the range of objects to 
be considered (Battersby, Teuber, and Bender, 1953). One group of subjects, 


told to use only the five objects on the table, solved the problem in an average 
time of 2 minutes. Another group, told to use any object in the room, includ- 
ed it in 7 minutes. The average time for 


ing the five objects on the table, solv i id: 
a third group, told to use any object in the room but not told specifically 
about the objects on the table, was 18 minutes. 


© now know that functional fxedness is a genul ; ‘ 
` : ; i 
can seriously impede the solution of construction problems and we know 


that it is temporary (Adamson and Taylor, 1954). But do we know how 
to eliminate it? Attempts to do so have been planned around hypotheses 
about associations, including verbal associations, aS well as — ne 
tures. The experiments require several groups of subjects bi E 7 Si 
different conditions because it is necessary tO demonstrate, first, that func- 
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tional fixedness is present, and then, that it can be rem 


ine phenomenon that 
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FUNCTIONAL FIXEDNESS: CAUSES AND CURES 


Like Maier, Karl Duncker, of the University of Berlin, also pied 
struction problems around the Gestalt concept of structure or arga h 
His best-known work, published in German in 1935 and laret irt abject 
CDuncker, 1945), reported several experiments illustrating how an os 
can be functionally concealed, or at least made unavailable ia P ilem 
solving, by being included as part of an irrelevant structure. His ps e a 
may be taken as an example. The task is to support three m o ant 
door at eye level, ostensibly for an experiment on vision. On a ta . cael 
objects, including a few tacks and three small boxes of different ae ety 
colors. The correct solution is to tack the boxes on the door as platforms 
the candles. nie aes 

The experimental variable in Duncker's experiments was the a Ti 
rangement of the materials. For an experimental group of seven al % For 
three boxes were filled with candles, tacks, and matches, se. hp dias 
the control group of seven subjects, they were empty. The a Wes h 
perception, or cognition, of the boxes used as containers would cp Lae 
using them as platforms. All of the control subjects solved the pro E with 
only three of the experimental subjects. Similar results were ee W 
several other problems of this kind, and Duncker’s BE A T 
heterogeneous functional fixedness. Cognition of an object used nA Ai 
function fixes it to that function (or means-end relation) and hind a 
cognition for another function, a special case of embedding. This penon hat 
tion could also be stated as the establishment of a set for one arn 
hinders another cognition if the emphasis is placed on the cognitive ra 
than the executive side of the set. aa he, itt 

In Duncker’s experiments the differences between groups was oe, sae 
they perceived; but others have introduced modifications to test the e : Ar 
what they learn. The independent variable is preutilization of an _ = bal 
the transfer paradigm, helpful in designing experiments on motor ang. ane 
learning, is helpful also in visualizing the sequence of events in dao A 
functional-fixedness experiments. Task A consists, for example, of ust das 
box as a container, and Task B consists of a problem. If the box is use ee 
the solution to the problem more often by those who have had Task A c= 
by control group, the outcome can be called positive transfer. If the bo 
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subjects try the Maier two-string problem using either a relay or a ee 
as a weight for the pendulum. The test was not whether the subjects wou 
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equally often to solve the problem under standard conditions. The preutiliza- 
tion consisted of using either the relay or the switch in constructing a simple 
electrical circuit. In solving the pendulum problem later, those who had used 
the relay now used the switch, and most of those who had used the switch 
now used the relay (see Table 4.5). In general, 17 out of 19 subjects avoided 
the object that they had just used for a different purpose and made use of 
the object not previously used. Other experiments by similar methods have 
also confirmed such functional fixedness. 

Research on other problem-solving processes, especially sets, has stressed 
the importance of instructions, and in one of Maier's experiments the in- 
structions directed the subjects’ attention toward uses of an object that would 
otherwise be overlooked. Instructions were also found to influence solution 
of the two-string problem, by limiting or extending the range of objects to 
be considered (Battersby, Teuber, and Bender, 1953). One group of subjects, 
told to use only the five objects on the table, solved the problem in an average 
time of 2 minutes. Another group, told to use any object in the room, includ- 
ing the five objects on the table, solved it in 7 minutes. The average time for 
a third group, told to use any object in the room but not told specifically 
about the objects on the table, was 18 minutes. 

We now know that functional fixedness is a genuine phenomenon that 
can seriously impede the solution of construction problems and we know 
that it is temporary (Adamson and Taylor, 1954). But do we know how 
to eliminate it? Attempts to do so have been planned around hypotheses 
about associations, including verbal associations, as well as cognitive struc- 
tures. The experiments require severa ups of subjects working under 
different conditions because it is necessary tO demonstrate, first, that func- 
tional fixedness is present, and then, that it can be removed. 
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TABLE 4.5. FREQUENCY OF USE OF TWO OBJECTS 
FOR SOLVING THE TWO-STRING 
PROBLEM BY TWO GROUPS 


Nature of Preutilization 


Test 

Object Relay in Switch in 
sed Circuit Circuit 

Switch 10 2 

Relay 0 7 


From Birch and Rabinowitz (1951)- 
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conditions. In the control group, with a switch and a relay in view, as = 
a few useless objects as decoys, 10 used the switch and 14 used the A w 
as a pendulum weight. One experimental group, Ey, used the smite a 
its conventional function, to make a small light bulb in a simple ko 
circuit go on and off, after which they did the two-string problem. an y 
3 then used the switch as a pendulum weight and 21 used the relay, thus 
demonstrating functional fixedness clearly. , _— 

But the experimenters did not stop at this point. They gave four other i 
varying amounts of experience with the switch used in novel ae 
such as a container for holding pins and a straight edge for drawing a ie 
Different groups (see Table 4.6) got two, three, four, and five uses, one oy 
which was the conventional use, by working problems for which these were 
the easy solutions—though a few hints were necessary for half the sabec 
Groups that used the switch in four or five functions did not become fixated; 
they used the switch as a pendulum weight just as often as the control PoP. 
In fact, as the authors pointed out, Table 4.6 shows a high inverse corre "i 
tion between the functional fixedness in a group and the amount of varied 
preproblem experience. tie 

This experiment has been criticized by Yonge (1966), however, on al 
grounds of a hidden order effect. Order effects in experiments of this Est 
had been noticed earlier by Van de Geer (1957), and Yonge suspected that, 
since the familiar use of the switch is easily recognized as such, minor a 
experienced later would not be very impressive. This line of reasoning. s 
to a study of variations in the order in which the familiar use of the switch 
occurs. saed 

Yonge’s experiment, resembling the first in most respects, also ba A 
college women, 23 in each of 5 groups. The control group, with no preutilize ‘ 
tion, used the switch 14 times as the solution to the two-string problem. i 
other group, which had used the switch as a switch in a circuit, used it for chs 
two-string problem only 6 times (see Table 4.7). These results are the 
expected evidence of functional fixedness. Yonge's other groups had 5 id 
of the switch, like E; of Flavell, Cooper, and Loiselle, but the order so 
varied. Those who got the familiar use of the switch first, followed by 
TABLE 4.6. FREQUENCY oF USE OF TWO OBJEC] 
FOR SOLVING THE TWO-STRING 


PROBLEM BY SIX GROUPS 

Test Preutilization Groups 
Object 

Used C E, E, E, E, É- 
Switch 10 3 8 9 13 12 
Relay 4 210 16 e i Ë 


From Flavell, Cooper, and Loiselle (1958). 
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TABLE 4.7. FREQUENCY OF USE 7 E 
I USE OF TWO OBJECTS FOR SOLVING THE TWO-STRING 


PROBLEM BY FIVE GROUPS 


Preutilization of Switch 


Test 
Object Famili ili 

amiliar Familiar Familiar Familiar 
Used Control Use Only Use First Use Third Use Fifth 
Switch 14 6 4 12 10 
Relay 9 17 19 1] 13 


From Yonge (1966). 


other uses of the switch, used the switch only 4 times and the relay 19 times 
hing ee gu es 
g group and the one 
that had only the familiar use of the switch. But those who had the familiar 
ned e w in the third and fifth positions, following the more un- 
These ee the test problems approximately as the control group did. 
aaa apparently negate the assumption that the effects of varied 
nt it a with the switch summate to produce a flexible approach that 
y si nctional fixedness. 
f onges phenomenological interpret 
arst in its conventional function is an 
a in its conventional meaning, unshakeable by subsequent minor uses. 
he same event is less powerful and does not produce functional fixedness 
minor uses have emphasized the multidimensional, or 


Thus Yonge's work does not re diate 
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n the test, showing stron 


ation was that the use of the switch 
unambiguous event that fixes the 


when it comes after the 
multi : : 
> ultifunctional, nature of the switch. 
he be ‘ : 

e beneficial effect of heterogeneous experience but ri 


any sj r 
hy simple calculation of these effects. 


a trend toward the analysis of natural objects— 


a of construction problems—into separate at- 
Je to the analysis of the artificial stimulus 
dimensions may be called func- 
igs, depending on whether one 
h the object or how he perceives 
ts, some functions are, or become, more 
salient than others, more relevant to the present problem than others, and so 
on. Though the details are not altogether clear, the experimental manipula- 
‘Sa do shift the frequency of use of different functions of a single object. 
Tence the analysis must correspond in some way to the subjects’ problem- 


solvi 
ving processes. 


These experiments outline 
Potential tools for the solutior 
tributes or dimensions, comparab 
Patterns of concept research. The separate 
tions, uses, response properties, oF meanit 
Wishes to emphasize what the subject does wit 
it. As in the concept experimen 


i ao FACILITATION OF REORGANIZATION 
ingle words are quickly organized into verbal structures, and alternate 


si z P 
tructures furnish an opportunity to study verbal associations and organizing 
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processes. Here are four words that Judson and Cofer (1956) presented with 
instructions to eliminate the one that is not related to the rest: 


Sailor Soldier Marine Ship 


Two meaningful organizations are possible: a personnel class, to which ship 
does not belong; and a seagoing class, to which soldier does not belong. When 
the words are printed in the above order, 94 percent of the subjects elimin- 
ated ship. But only 44 percent eliminated ship and 55 percent eliminated 
soldier when they were printed in another order. Such results can occur if 
reading the first two words temporarily activates a hicrarchy of responses, 
specifically, a class of verbal responses: 


Sailor Ship Marine Soldier 


Thinking in terms of a hierarchy of uses for an object, one might ask if a 
verbal hierarchy will correspond to the instrumental hierarchy so that one 
can predict from verbal responses to problem solving. Staats (1957) investi- 
gated this question by having college students list uses for objects, including 
a screwdriver, and then attempt the two-string problem, then again list uses 
for a screwdriver. Only a few gave a verbal weight response to the screw- 
driver at first, yet nearly all solved the problem by tying the screwdriver to 
one string as a weight. After the problem had been solved, weight responses 
to the screwdriver were frequent. Even though the verbal response hierarchy 
did not predict use of an object in problem solving, the solving did alter ee 
verbal response hierare hy. i 

Several attempts have been made from time to time to reduce functional 
fixedness by verbal facilitation, but the results have been ambiguous. More 
recently, Glucksberg and Weisberg (1966) tried the trick of labeling the 
objects in view in order to shift their frequencies of use in solving the 
problem. Adapted from Duncker’s candle problem, this presents a candle, a 


book of matches, and a box of tacks on a table, with instructions to affix the 


candle to the wall so that it burns properly. The trick is to rescue the box 


from its function as a container and tack it to the wall, where it has the 
function of a platform to support the candle. Glucksberg (1964) had ob- 
served that blindfolded subjects struggling with such problems were aided 
when accidental contact with the box gave them a cue. Perhaps those sub- 
jects who look right at the box of tacks and do not see the box will be aided 
by a verbal cue. Glucksberg and Weisberg prepared a printed form of the 
candle problem and varied the labeling of the objects (see Fig. 4.13). When 
all four objects were labeled—candle, tacks, box, and quiches —O5 percent 
of the subjects gave the box as their first solution, as compared to 65 percent 
with only the tacks labeled and 54 percent with none of the objects labeled. 
Another experiment displayed the actual objects on a table for three small 
groups, each composed of eight college men, and varied the labels by pro- 
jecting slides resembling Fig. 4.13 on a nearby screen. When all four objects 
were labeled in this way, functional fixedness of the box was almost elimi- 
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OURE 4.13. Printed 
jerm of the candle prob- 
ie From Glucksberg and 
Weisberg (1966). 


nated; but when only the tacks were labeled, and when one of the objects 


was labeled, the functional fixedness persisted. , , 

lt is important to note that labels are discrete. The subject perceives 
the box of tacks as a whole, but he perceives the labels box and tacks as 
Separate labels and thus is more likely to perceive, Or conceptualize, the 
objects they label as separate. We have the same question of interpretation 
here that appears in verbal facilitation of concept attainment. Does the verbal 
label act as a cue to elicit the critical response, OF does it facilitate perceptual 


differentiation? The labeling of all four objects may draw special RN 
to the instrumental value of one of the four, or it may simply distribute at- 
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to four objects rather than three. ; ; 
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be a special case because pliers are actually used with wires more than 
wrenches are, but we know that adults are prone to semantic associations in 
other situations, so these results can perhaps be duplicated with other objects. 


CONCEPTUAL GROUPING 

Duncker’s original view of functional fixedness was a cognitive one, stated 
in terms of the perceived functional properties of objects. Concept gapen 
ments have demonstrated how the various properties of multidimensional 
objects can be organized in one category or another, hence Terborg (1968) 
designed conceptual grouping tasks to reduce functional fixedness. The a 
problem was an electrical circuit problem, modeled after Glucksberg’s, the 
critical function being the use of the metal blade of a screwdriver as a con- 
ductor to complete a circuit. According to the hypothesis, practice in grouping 
miscellaneous objects according to their properties as electrical conductors 
would emphasize this function, so that it would be more readily perceived 
as a property of other objects. i 

As listed in Table 4.8, each group of 20 college men had 10 conceptua 
grouping tasks which, in the two control conditions, required them to sort 
small common objects by a nonfunctional attribute, such as length, or a non 
electrical function, such as pounding. The experimental subjects had ty 
training in sorting objects with conducting and insulating propertics, sue 
as paper clips and crayons. Although the results were not as clear-cut statisti- 
cally as they appear in Table 4.8, because of the usual variability of onan 
times, the experiment does suggest a rational method for reducing functiona 
fixedness. 


i : . nc- 
Just as there are several ways of describing the essential difficulty of fu 


TABLE 4.8. EFFECTS OF CONCEPTUAL GROUPING ON 
SOLUTION TIME (IN MINUTES) ON CIRCUIT 


PROBLEM 

Group Training Task Solution Time 

Cc, Attribute grouping, 5.23 
nonfunctional 

ey Functional grouping, 5.41 
nonelectrical 

E, Functional grouping, 3.38 
2 electrical functions 

Es Functional grouping, 2.74 
4 electrical functions 

E; Functional grouping, 2:75 


4 electrical functions, 
1 with screwdriver 


From Terborg (1968). 
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tional fixedness problems, there are several ways of overcoming it. But we 
should note that we are discussing a special difficulty and its removal. None 
of the training procedures outlined raise the level of problem solving above 
that of control groups not held down by functional fixedness. 


FROM ASSOCIATION TO STRUCTURE 

Complex structures of knowledge and behavior are not the property of 
A cognitive structure is an explanation for 
henomenon in its own right, to be explained. 
a structure is that once the elements are 
Mandler (1962) has collected considerable 
experimental evidence to show how such a structure may be constructed 
from associations and may be recalled and transferred to new situations as a 
whole. Although he maintains that some behavioral phenomena defy current 
associationistic interpretation, an extract from his discussion is quoted below 
to illustrate what this kind of interpretation may contribute to our under- 


standing of problem solving. 


any one system of psychology. 
certain phenomena and also a p 
The most important property of 
integrated, it functions as a unit. 


Once a response sequence has been integrated and acts as a unit, it develops a 
structural representation, a “central” analogue of this new response unit which 
can function independently of the overt response sequence. . » - Analogic struc- 
tures permit covert trial and error behavior, i.e cognitive manipulation of 
previously established behavior. In this sense, the analogic representation of 
zi F “ * . $ 
a prior behavior sequence is one possible hypothesis to be applied to a par 
ticular situation. Given many such structures which are relevant to a situational 
input, the several structures ‘will occur seriatim and covertly until an appropriate 
one is expressed behaviorally. If it is incorrect, a new structure miy aran us 
giving the appearance of an entirely discontinuous process, of shifting from 
si eae ” . . + a hs 
hypothesis to hypothesis. We say “appearance of discontinuity because a con 
tinuous process prior to the appearance of cognitive structures 1n fact gives rise 
to them. Structures are developed on the basis of associationist stimulus-response 
relationships but, once established, enable the organism to behave “cognitively 
[pp. 417-418]. 
e units, the switch-light problem offers 
how one attains patterns, higher units, 
f separately acquired associations be- 
these to solve problems. G. A. Davis 
n order to compare covert and overt 


Since switches and lights are discret 
an opportunity to study in some detail 
or substructures by the integration 0 
ome switches and lights and oa use 

1967) chose the switch-light pro Jem i ' 
trial a one pio to Davis, if the nature of the problem permits the 
subject to predict the results of his moves, he solves it by covert trial and 
error. If, however, he cannot predict the results of his ee he must pro- 
ceed by overt trial and error—and thus his problem-solving attempts can 
be more easily observed. Davis therefore presented his subjects with a display 
of 12 lights and a panel of switches; various problems involving the relation 
between switches and lights were devised. The one illustrated in Fig. 4.14, 


for gaaracle, raided lighting 2 and only 2 pairs of lights; but while some 
switches were relevant to the problem, some WETS cpr r (they turned on 
lights other than the target pairs) and others were distracting (turning on 


FIGURE 4.14. Example of a problem involving 
10 switches. Three switches are relevant (nos. 3, 
6, 10); two are distracting (2, 8); five are irrele- 
vant (1, 4, 5, 7, 9). From G. A. Davis (1967). 


only one of a target pair). As one might expect, the difficulty of a problem 
increased with an increase in the number of available switches, both relevant 
and irrelevant, and especially with an increase in the number of distracting 
switches. Even one distracting switch greatly retarded acquisition of the 
structure of relations between the switches and lights. , 

Davis gave one group of subjects special pretraining on the correct associ- 
ations between each switch and a pair of lights; these subjects, in comparison 
with others, then proceeded more slowly and made fewer errors. According 
to the experimenter’s interpretation, they could now predict the results of 
their moves, hence they solved by covert trial and error. There may be other 
interpretations for the decrease in errors, but usually a decrease in time 
accompanies a decrease in errors. Reversing the usual correlation between 
dependent measures points to a difference in problem-solving activities be- 
tween groups. We might say, in the language of the quotation from Mandler, 
that the trained group could proceed cognitively because they knew the 
functional relations between switches and lights, and we might guess that 
they could represent these substructures graphically or verbally if necessary- 

In general, we do not deny the importance of a cognitive organization when 
we admit that it was acquired by uncontrolled prior learning, Or recent 
preutilization, or instructions that draw attention to new possibilities for 
organization. In the same sense, we do not deny the importance of prior 
learning when we admit that the crucial learning for solving problems }§ 
often that of grasping a cognitive structure. l 


The Judgment Process in Problem Solving 


Psychologists who have analyzed their own problem solving and those 
who have observed others solving problems have usually included a process 
of evaluation or judgment of the solution. The thinker judges the solution 
acceptable and quits work, or he rejects it and recycles the problem-solving 
machinery. Thus, judgment of the solution is the last operation if it is af- 
firmative; but judgment also takes place throughout the course of problem 
solving, as in deciding on one of several alternative plans of attack. Under 
productive thinking, Chapter 7 will consider problems for which multiple 
solutions are produced so that solution judgment can be conveniently studied. 
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The problems of the present chapter are supposed to have one correct solution, 
easily recognized as such, hence judgment goes quickly, and the judging 
process is easily overlooked. But the involvement of judgment in the solution 
of anagrams has attracted some attention since this operation, like so many 
others, is influenced by word frequency. 

In a multi-stage procedure, the subject tries out combinations of two or 
three letters and matches these with the words that he knows. Presumably, 
words of high frequency are more available for comparison. Whether a word 
is found to match the solution attempt, or no match found and the attempt 
abandoned, the comparison would be easier if the word is familiar. This 
prediction was confirmed for a representative sample of five-letter words by 
printing anagrams beside attempts at solution, for example, garus sug ____, 
ing college students to differentiate the true starts from the false ones 
h 1967). The solution attempts were evaluated 
r for anagrams with solution words of high 
Evidently, the familiarity of the high- 
f attempts at solution and the last 


and as 
Johnson, Lynch, and Ramsay, 
with more speed and less erro 
frequency than of low frequency: 
frequency words aids both the first stage 0! 
Stage of evaluating these attempts- 
An ingenious method devised by 


Bremer (1968) for studying judgment as 
well as production of solutions to word problems assigned a special role to the 


judgment process. The problem was to make a — enen word nee 
and final letters, such as T — — K. Some abnt Pa pea ia 
words of high frequency, such as TOOK, others ~~. i fa a 
as TUSK, and the subjects, who were college _ f bt i te tlie specified 
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solution, The subject could produce 2 nied ae (hen sele one of the 
Could produce several words of varons shim th a limited number of 
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Problem-Solving Processes 
from Computer Simulation 


We might be tempted to dismiss computer simulation of problem solving 
as an idiographic enterprise of no general scientific significance. Why take 
the time to simulate in detail the particulars of the problem-solving output of 
one subject? Are there no generalizations to show for two decades of these 
digital exercises? We must admit that no new psychological principles have 
been advanced, nor have any of the old ones been repudiated, by such work. 
But it is instructive at this time to see what operations computer model 
builders have had to assume in order to approach the simulation endeavor. It 
is easy to envisage how machines can store symbols in the state of an electronic 
circuit or on a tape to simulate memory, and retrieve what has been stored 
to simulate recall. Recognition can be simulated by a systematic scanning 
and matching of what has been stored against new imput, after both have 
been converted to the same system of symbols. Combining operations, so that 
the output of one is the input for the next, runs off easily. There are a few 
characteristic features of computer programs, however, that deserve special 
mention as showing what has been done so far to simulate the flexibility of 
human thinking, 

It is often possible, after a problem has been explicitly stated, to proceed 
logically, though perhaps at considerable length, to a solution, and computers 
are very good at such monotonous tasks. But most people and most simulation 
programs, instead of systematically exhausting all possibilities, try various 
tricks or strategies, called heuristics, to short-cut the process. The concept 
problem illustrates this well. Given all positive and negative instances and 
complete memory, a direct solution is possible. But we have seen that the 
human problem solver, with 
he gets, prefers to formul 
and formulate another, 


a limtied capacity for processing the information 
ate a hypothesis, accept it, and stop; or rejeo i 
accept the second one, and stop; or reject it and 
formulate another; and so on. Most simulation programs, therefore, include 
a matching of the outcome of a preceding operation with a specified gral or 
subgoal, and the direction of the 


h sional on 
next branch of the program is conditional or 
this matching. In simulating cor 


ncept attainment, for example, the program 
might select “angular” as a criterial attribute and test this against the next 
instance. If the match is correct, the program retains this hypothesis and 
proceeds to the next operation. If not, the computer routine that selects an at- 
tribute for testing recycles with another attribute. The overall flexibility of the 
program and the similarity to human thinking comes from the ability of the 
program to perform such an operation and make subsequent operations con- 
ditional on the matching of the outcome with a desired state of affairs, and to 
recycle the operation as often as necessary. This conditional branching and 


recycling can also be included as a subroutine in a larger routine that is itself 
recycled as necessary. 
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To illustrate these processes as they might occur in simulation of the 
Goodnow, and Austin, Reitman 
a computer program and a human 
rd is a positive instance. Note 


Wholistic strategy described by Bruner, 
C1965) has set down the seven steps that 
subject might perform, assuming that the first ca 
that Reitman has italicized the information-processing operations. 


L:— Rie i o 
Remember as hypothesis the value of each attribute. 


2.—If there is a next instance, get it; if not, stop. (This is called a conditional 
5 branch.) ` 
3.—Compare this instance against the hypothesis to determine whether it is 


difference. 


identical, and note and remember any 
if it is not, do the next step 


+.—IF the instance is identical, do step 2 again; 
Cconditional branch). 

—Get feedback indicating whet 
cept. 

6.—Tf answer is “no,” do step 2 again; if “ye 

7.—Forget any values of the previous hypot y 
positive infirming instance. Do step | again w1 


is an instance of the con- 


wn 


her the card actually 


s,” do next step (conditional branch.) 
hesis which differ from those of this 
th this new hypothesis [p. 66]. 
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FIGURE 4.15. Flow chart 
current working hypothesis 
E. S. Johnson (1964). 


for a simple information-processing model. H stands for the 
; p for probability; a is a parameter of the model. From 


tionary prepared for the computer to consult when needed, fair success was 
attained, and the program seemed to perform as human subjects perform. 
Some subjects, however, used auxiliary cues from their own experience, which 
the computer, with its carefully T 

In attempting to reduce this di 


human performance, Reitman (4 


programmed experience, could not simulate. 
fference between computer performance and 
} 1965) was impressed by the long-term per 
sistence of human thinkers in spite of distractions from the telephone, it 
relevant ideas that spring from nowhere, and internal anxieties; so he built a 
program, called Argus, that would operate with a gradual loss of information 
and with variable control over the cognitive elements of the system. He says: 


gnitive processes go on at once, that all information 
y away unless special circumstances intervene, ani 
control in thinking varies from almost none, in free 
egree in some individuals in some forms of goal-directed 


that centralized sequential 
association, to a high d 
problem solving [p. 225 


Argus is capable of solving simple analogy problems in word form, such as 


bear : pig :: chair : (foot, table, coffee, strawberry ) 
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as men do, but some analogies cause difficulties because human subjects pos- 
Sess all kinds of information about words—not altogether complete or correct— 
which influences their thinking but not the computer's. What is needed next, 
perording to Reitman, is a model which, when exposed to a casual information 
E will store portions of that information, as harried human beings 

o, and retrieve some of it, in the peculiar ways that men do, when faced with 


an unanticipated problem. 
Not only verbal processes but perceptual processes also impose a structure 


an the problem materials that appears to offer difficulties for computer simul- 
ation, When expert chess players look at the positions of two dozen pieces on 
a board, they quickly perceive functional groupings: pieces vulnerable to 
attack from other pieces, pieces that can attack others, pieces that defend 


others, and so on. Highly proficient players with years of experience have dis- 


played an almost unbelievable capacity to reproduce the positions of the 


ei after only a brief exposure of the board—but only if the pieces are 
isposed in meaningful patterns, OF substructures, not if they are disposed 


at r à 
t random (De Groot, 1965). “il iy i 
aded so that eye movements 


Fortunately, a chessboard is spatially exter 
around the board can be photographed and fixations located with moderate 


accuracy. Simon and Barenfeld (1969) therefore adapted their information- 
;mulation of eye movements by 


processi A 

processing model of problem solving to the si a 
ial i : : z 
aking a few reasonable assumptions about the information, on the functional 


relations between pieces obtained in one fixation and the next fixation implied 
A 


by this information. Some fragmentary comparisons with tracings of actual 
eye movements show fair agreements this suggests that an egg 
ing model can account for the ganization of the problem 


preparatory or 
Materials that occurs in the first few seconds of exposure. 


Main Trends and Implications 
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demands. The familiarity of the materials of the problem has a ae 
helpful effect, but familiarity with potential solutions, from paposite ss F 
past or present, has a much greater effect, facilitating judgment as well as 
solutions. ry i 

The old attempt at analysis of the thought processes by examining a 
own thoughts or by observing other people solving problems has been broug n 
up to date by exposure devices that allow more complete E 
timing of moves. The trend now is toward more objective methods, w hich 
necessitate choosing problems that are convenient for objective observation 
Coften very artificial), and toward formulating and testing specific hypotheses 
about observable behavior. The aims of modern analyses are more modest 
than a description of the thought processes; the proce and effects ex- 
amined are limited both in number and magnitude. 


The associative mechanisms developed to extend theories of simple learn- 
ing into the problem-solving domain lean heavily on the hypothesis of a 
response hierarchy, which has been quite successful in accounting for the 
effect of word frequency in anagram solution. Another associative mechanism, 
mediating responses intended to integrate separately learned behavior seg 
ments, has had only limited success as yet. 

Demonstrations of the 
problem-solving set 


control of problem solving, for better or worse, by a 
are as old as this century, the interpretations of Ta 
demonstrations varying with the theoretical fashions of the times. The see 
situation and the general instructions are the basic conditions that activate 
the general set or adjustment to a problem-solving assignment. Given on 
general set, a more specific set for a specific method can be quickly established 
by a short series of problems that are readily solved by such a method—or 
even more quickly by specific instructions. Single words and their associations 
exert a semantic influence while the instructions exert a syntactic influence. 
Some of the effects of sets can be interpreted by a 


ciation hypotheses derived 
from animal le 


arning or from free recall. The similarities between the sin oe 
experiment and the concept experiment are also of theoretical interest an ; 
as in concept learning, some evidence for discontinuity under certain ee 
ditions has been presented. But evidence for a continuous, cumulative learning 
curve obtained by uncritical pooling of data from different subjects is ee els 

An information-secking process can be isolated by arrangements Whien 
minimize the other processes and permit the subject to uncover the acme | 
information in his own way. Most ordinary adults improve somewhat by a ; 
justing their Strategies to their experience with the work required and the 
probabilities of finding the information. But 
learning, they seldom discove 
with it. 


Unlike the jigsaw puzzle that may be solved by manipulating the li 
directly, many problems are solved by representing the data symbolically an 
manipulating the symbols. Transforming the information from one system tO 
another while preserving the structure is necessary for solving some problems, 
but these processes have not received the experimental and theoretical at- 


i i ability 
» as in experiments on probabil 
i i te tick 

r the mathematically optimal strategy and s 
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tention from students of problem solving that they deserve. Obviously, 
errors can occur in the coding of data into symbols, in the manipulation of 
the symbols, and in the application of the symbolic solution to the original 
problem situation. 

The cognitive structures that are most useful in solving problems are—in 
addition to perceived structures of objects—behavioral structures, or plans, 
whose elements are simple acts. The understanding of a structure is some- 
thing more than the ability to perform the elementary acts, and methods are 
available for differentiating achievement of the structure from achievement 
of the elements. Principles, which make general assertions, are particularly 
ns are solved by application of a 


useful for problem solving: some probler 
principle and some by the mere statement of one. These measures and others 


can be used to test achievement of a principle, and the converging results of 
several measures are more convincing than the results of one alone. Sequences 
in the development of principles and other cognitive structures parallel those 


in the development of concepts. 
> 
But problem solving does not st 


call problems stimuli, but they are ofte 
an others. 


art from nowhere. Speaking loosely, we can 
n complex stimulus patterns, some of 
Anagrams in the form of words are 
e than comparable anagrams in non- 
arrangement of letters are 


whic > i 
vhich resist analysis more th 
more difficult to break up and restructur 


word form, And anagrams that require much re: ar 
i ends of the structure are preserved. Hence it is 


especi va 
Specially difficult, unless the 3 S f 
reasonable to say that problem solving consists of reorganization as well as 
Organization. | Reieaeth 
, A N pe , sived as a characteristic 0 the per- 
Functional fixedness was originally conceived as are ; I 
the embedding of the critical function of a com- 
ure—and investigated by manipulating the initial 
Iso been conceived as a consequence of 
Previous learning of the components and investigated KA men Sean pre- 
utilization of the components. Functional fixedness interferes with solution, 
as does the problem-solving set under some conditions; but, to go one step 
further. k ; f y ani bé reduced by manipulation of the perceived 
, such interference car > ; $ 
situation, by more varied preutilization, by strategic labeling of the compo 
nents, ¢ j ‘ anting tasks: 
S, ar > practice prting tasks 4 s 
nd by practice on SO he trend among several investigators, 
Equally important for the future om bj tas aultidimenst al 
Pranching out from concept research, to treat an object a n : a a 5 
as if the think enale oe various dimensions, ened ae and other 
ker analyzes : asize si 
attributes, Just as the cognition of a large red oe 2s EE aE need 
calar or inne a ma P nition of the other attri utes temporarily more 
difen hape, making F, function of an object make cognition of other 
ult, so does cognition of one fi 
u ~ti D ° 
oe one temporarily more difficult. ante treated as hypothetical medi- 
chavioral structures, like concepts, © LSE 
s ? : i reted by principles of cog- 
ators, with either cognitive proper" interp pl ae neat we 
ither cogr <- to be interpreted by principles of 
h and argument have not 
vo interpretations. The 


Ceived si P 

ived situation—namely, 

bone: : = 

Ponent in a larger struct 

arre 5 oR 4 ae 
rangement of the materials. It has a 
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theoretical stalemate suggests that hypothetical properties of hypothetical 
mediators are not distinguishable in fact or in theory. There is a clear trend, 
however, toward mediators with structural properties, whether they are called 
stimulus-response chains, conceptual systems, cognitive schemas, or sub- 
routines. The most human characteristic of programs for computer simulation 
of problem solving is their ability to organize elementary moves into these 
subroutines and integrate them as units, dependent on the outcomes of 
previous moves, into the larger movement toward a solution. 
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CHAPTER 5 


THE SOLUTION 
OF PROBLEMS 
CONTINUED 


Training in Solving Problems 
Individual Differences in Problem Solving 
Trends and Implications 


Major 


veryone knows that the solution of a prob- 
lem may be a complicated enterprise. We 
have related how this depends on various 
s—such as associations, informa- 
d principles—some of which are 
ements of previous learning and thinking, while 
on the spot during solution of the 
so on the appropriate 
hese components into the larger structure 
Jem, often in spite of an initially resistant 
such system of activity, composed of 


l EIN component 
tion, skills, concepts, an 
the achiev 
others are achieved 
present problem. Solution depends al 
organization of t 


that fits the prob 


organization. In any : 
so many interacting processes, there are multiple possibili- 
x there are many points where 


ent because 
be inefficient, and many dimensions in 


vers differ from one another. The two 
training in solving prob- 
among problem solvers. 


ties for improve™ 
performance can 
which problem so 
main topics of thi 
lems, and individual differences 


Training in Solying Problems 
A out difficulties in prob. 
Some generaliz ns abo one ene p T 
solving and how they are OY erc glea rom hun- 
Seeds al experime nts) were set forth in the previous 
thing to establish generalizations 


chapter. But it is one S 
P er and another thing to teach peo- 


about problen 
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ple how to do it. A theory of learning, or thinking, should not masquerade 
as a theory of instruction. 

The controversies over methods of instruction are somewhat similar in shape 
and intensity, nevertheless, to controversies over theories of learning. And 
one of the most persistent of these, which may be phrased as rote Jearning 
versus meaningful learning, or drill versus understanding, applies with equal 
emphasis to problem solving. A second controversy, overlapping the first, is 
that between discovery and expository learning. Is it better to let a student 
make errors and discover the solution by himself or to explain to him how to 
solve a problem? This is an active topic of discussion even though there is 
some uncertainty about the meaning of discovery in this context (Shulman 
and Keislar, 1966). But one thing is clear. There is no magic formula for 
teaching people how to solve problems. Research will not prove that one 
method is right and the others wrong. Progress is more likely to come from 
a better understanding of the effects of alternative methods and their com- 
binations. À 

Any comparison of instructional methods must consider the criterion of 
achievement, since different criteria suggest different conclusions. The cri- 
terion may be a reduction in solution time or in number of errors when solv- 
ing a series of problems. More important, however, is the transfer to other 
problems, either immediately or later. The transfer problems may be similar 
to the training problems, but if we are concerned about remote transfer or 
general problem-solving skill, the transfer problems should be different. 
Some investigators have included the criterion of interest in solving problems, 
because it is claimed that some methods are boring while others are fun. 


TEACHING CONCEPTS 


Concepts are necessary equipment for the problem solver because certai! 


problems are stated in general terms. These terms may refer to class concepts 
such as deciduous tre i 


and uninsured drivers, or to dimensional concep 
such as acidity and resistance. In other cases, solution depends on knowledge 
of a concept that does not appear in the statement of the problem. There 
the concept is a mediator. Solution may depend, for example, on arranging the 
elements of the problem in the form of a right triangle and knowing the 
properties of right triangles. In any case, achievement of the concepts involve 

is necessary, and the question arises how they should be taught to facilitate 
their use in the solution of problems. 

Conventional concept research is not directed toward this question. Look- 
ing at the method of the concept experiments outlined in previous chapters 
from the teaching standpoint, we would describe it as learning by discovery: 
The experimenter does not usually tell the subject in advance that the class 
consists of large squares with double borders; he merely confirms correct 
guesses, expecting the subject to discover the class by himself. There are so™° 
instructional implications, however. It is important to note that the genera 
set communicated by the experimenter, which is a form of instruction, has ’ 
strong effect. Few subjects discover subtle concepts in laboratory experiments 
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without being told to look for them. It is pertinent also that negative infor- 
mation, telling what a concept is not, is less helpful than positive information 
and that this difference can be reduced by special training. 
i ase ie ied Lee ges os at i course of this inductive type 

; a g a ply tell the subjects what the concept is, 
that is, what we mean by dax or deciduous tree or acidity. But we cannot 
assume without data that this method is better than others. College students 
given definitions of four concepts Cpage 82) did well on later tests of con- 
cept achievement but no better than subjects who learned the same concepts 
by other methods. The possibility of interference between concepts in this 
simple experiment reminds us that the greater likelihood of interference in 
school, on the job, and in the library, may shift the relative weights of factors 


found in the laboratory. One would expect, however, that the superiority 


of the mixed program—combining four methods of teaching concepts—would 


be generally applicable. 


TEACHING PRINCIPLES 
Many teaching procedures combine the teach 
But since principles have a special place in problem solving, their teaching 
ace in discussions of training for problem solving. Is it 
les for complex performances 
and solving an algebra 
themselves? What would 


ing of concepts and principles. 


also has a special pl 
more efficient to teach students the basic princip 
—such as making a cake, wiring a television set, 
problem—or to let them discover the principles by 
be the optimal combination of problems for teaching principles? 

One of the earliest experiments On this question used the disk-transfer 
problem later used by Cook (see page 139). Ewert and Lambert (1932) gave 
s of instructions, but the most efficient perfor- 


their sub} ; 4 
heir subjects various kind } 
ho were given a statement of the principle for 


mance was attained by those wW 
such efficient performance. 
Another classic series of experimen 
Parison of organizing and memorizing, 
complex performances, such as card tricl 
tudents W 


Katona (1940) called a com- 


ts, which 
required college students to master 


< and matchstick puzzles. In one of 
as given the exact order of arranging 
ized by repetition. Another 


these exper 

sy experiments, a group of s 
he cards to perform a trick, which they memor 
group observed a demonstration and explanation of the principle of the 


tick, given only once, and it was this group that was most surgani a 
Performing the tricks later. A group that received a verbal statement ot the 
other two- 


Principle was intermediate between the 

A number of studies in the 1950s followed up these early $ 
use of Katona’s problems, disk-transfer problems, arithmetic problems, word 
Problems, and others. One by Craig (1956), with the familiar oddity problem, 
Sives a good illustration. Each of his problems consisted of five words, four 
to be related by a principle, the other to be identified as the odd one. The 
Principles for different blocks of problems were based on similarities of sound, 
Spelling, or meaning; but one group of students was told only that there 


wi rae ; à 
ere principles, while another group cou 


tudies, making 


ld read the relevant principle 
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printed above each block of problems. This latter group did better on n 
immediate transfer test of new problems involving the same principles anc 
also on another test a month later. They were not superior in learning 
different principles for different types of problems however. A 

It appears, in general, that there is much to be said for teaching prin- 
ciples, especially when the problems are difficult and the criterion is the 
amount of transfer to new problems. But generalizations are hazardous 
because, as Duncan (1959) pointed out, after reviewing many of these 
studies, “Anything one learns probably has both positive and neg 
transferring effects, depending on the situation [p. 401].” These effects may 
vary with the specifics of the teaching procedures and materials. Recent 
investigations have looked more critically at these specifics. 


ative 


MATHEMATICS ; 
The concepts of mathematics, strictly speaking, are few. Most of thë 

learning consists of the acquisition of mathematical relations, principles. 

rules, and operations for carrying out mathematical manipulations that are 


A $ ' f 
learned and recalled as higher-order units. Recently, the development © 
teaching machines and the 


Me jals has 
demand for efficient programmed materials h 
accelerated the interest in o 


ptimal methods for teaching such units and their 
use in problem solving. Gagné and Brown (1961), therefore, tried out three 
programs for teaching high-school boys to derive general formulas for me 
sum of a number series, such as 1,2; 4, 8, 16, —, —. If 2 represents the 
position of a term in the series and T the value of a term, it is casy to m 
that when m is 6, T is 32. But the problem is to find the sum of a series d 
n terms, the formula for which in this example is T,,,;—1. All students had 
the same introductory program, after which three groups had different pro” 
grams designed to teach four such formulas. The riule-and-example program 
gave each student the correct formula and required him to apply it to a few 
examples, then another formula and a few more examples, and so on, procee i 
ing by small steps. The discovery program asked each student to try to pe 
the formula by himself, giving hints as necessary. The guided discovery PY? 
gram asked each student what was necessary to get the sum, whether adding 
or subtracting would be helpful, and so on, and later asked for a genera 
tule. The next day each student did cach problem again, as before. Then 
all students did four new problems, getting hints as necessary, to determine 


Pt ‘ aad w 
how well each type of training would transfer to the solution of similar ne 
problems. 


The results were quite clea 


z j jas 
r-cut. The method of guided discovery W 
most effective; with this metho 


d, the mean time for the four new problems 
was 16 minutes. Mean time for the discovery method was 20 minutes, an 
for the rule-and-example method 27 minutes. Certainly, other programs: 
better or worse than these, could be designed under the same three labels. 
But by looking closely at what each group of subjects did, Gagné and Brow? 
concluded that the superiority of guided discovery in their experiment = 
sisted in the use of previously learned concepts, as represented by such 


uw 
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symbols as n, T and T}, in new contexts. The discovery program made 
use of the same concepts, but systematic practice and guidance in their use 
was not given. In the rule-and-example program such symbols occurred as 
stimuli, in the formula provided, but the responses to these consisted merely 
of the writing of numbers. In comparing methods, we should look not only 
at the names but also at what the student learns under each method, whether, 
as in this case, he learns to substitute numbers for symbols or to manipulate 
concepts. 

Mistakes can easily be made in arranging the materials for teaching 
Principles. One characteristic to be watched is the heterogeneity of problems 
(Traub, 1966). A group of sixth-grade pupils, given a homogeneous series 
of problems in graphical addition of integers, acquired the necessary prin- 
ciples, in a Ütited way, and solved new problems better than a control group. 
But they skipped the harder problems and they made stereotyped errors. 
Another group, given a heterogeneous series of problems with more varia- 
tion in numbers and algebraic signs, spent more time on the training series, 
but they did much R on application to the new problems than the other 


Stoups. 

There is no magic in learning by discovery. If it is superior, why is this 

so? One possibility, put forth by Berlyne (1965), is that the discovery method 

causes cognitive conflict and thereby increases motivation for understanding 

the problem. In a study by Worthen (1968), however, use of the dis- 
S ms aes SS fference in the subjects’ general 


a noticeable di 
a — vad Scandura (1968), on the basis 
ed to several combinations of methods, 


nciples by the discovery method learned, 

Not only ar Iso something about deriving principles 
y the relevant principles, but als ee 

that can apply to eee tesis. ere given the rule had no 


Speciz nab 
Pecial reason to learn how to derive it for t 
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les. Di Vesta and Walls 

vi les for rearrang- 
(196 lysi by givi n ru g 
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Serie: . 
s and those on the transfer se hen the solution rules acquired 


tion 4 , ccur 
s, that the most improvement ol ra hase neede d fog the transfer 
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ANAGRAMS 
™ e noted (in the preceding chapter) 
the solution of a series of pre Jems, 
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especially in experiments on verbal Jeanni 
clarify the application of P ‘aie 
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from one principle to another. Then what is the optimal sequence of train- 
ing problems for developing flexibility in solving the transfer problems? 
Di Vesta and Walls (1968) tried out several variations in training problems, 
preceding transfer problems that required shifts in the rule for rearranging 
the letters and also in the category of the solution word. Variations in train- 
ing with the letter-order solution rule had surprisingly little effect on the 
transfer problems, but training with anagrams that had multiple categories of 
solution words did aid shifting on the transfer problems. Shifting from a set 
for one category of solution words to another may be considered a kind of 
strategy, and Di Vesta and Walls point out that those who trained on multiple 
categories had more practice on the “win-stay,” “lose-shift” strategy. Despite 
the difference in content, these results agree with Traub’s for arithmetic 
problems in emphasizing the beneficial effects of variations in the training 
series when subjects turn to new problems. 


STATING THE PRINCIPLE 


“If you want to learn something, teach it.” This old truism can be rephrased 
as the hypothesis that learning is more efficient when the learner puts the 
relevant information into words for communication to others, or as the say- 
ing goes, “verbalizes” it. In Chapter 2 we noted that attempts at the verbal 
definition of a concept facilitate the attainment of figure concepts: But 
attempts at the statement of a principle have more complex effects. It is not 
the actual enunciation of the principle that is important, of course, for i 
principle stated by the teacher can easily be memorized and repeated verbatim 
by the student; it is the generalization of the principle by the student on the 
basis of his experience with certain problems that is supposed to facilitate 
application to other problems. 

Ervin (1960) approached this question by training children in the third 
and fourth grades to shoot a ball so that it would ricochet off a backboard and 
strike a target. Some of the children were able to formulate an explanation 
for the path taken by the ball, but others could not, even though their pe“ 
formance was successful. When a transfer task called for aiming a flashlight 
at a mirror to strike a target, the advantage of formulation of the principle 
showed up. But at this age, although language and thought are both fairly 
well developed, they are not well correlated. Ervin pointed out that the 
verbal statements of these children did not correspond very well with their 
problem-solving actions and should not be taken too seriously. “The gre?t 
skill that children acquire in verbal learning is something of a hazard in 
that it is likely to reward both teacher and pupil so readily that the teacher 
overlooks the learning criterion she is most concerned with, which is not 
necessarily verbal learning alone [p. 553].” 

The familiar disk-transfer problem described on page 139 is convenient 
here because the moves are easy ones that can be made without much thought, 
but they are discrete moves that can also be conceptualized and describe 
verbally. Thus the subject can talk about patterns of moves and plans sed 
rules for efficient solution of the problem. He can make some progress by 
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planning one or two moves ahead, but he can make more progress by working 
out a more rational formulation. 

Gagné and Smith (1962) had boys in the ninth and tenth grades work 
this problem with several variations in instructions in order to study the 
contribution of the subjects’ verbal statements to overall efficiency of perform- 
ance. It was the subjects who were encouraged to state why they were mak- 
ing a move at the time they were making it who did best after shifting to new 
problems of the same type, both in respect to minimal number of moves and 
to the adequacy of their statement of a general principle for such problems. 
Similar results have been obtained by other investigators (Davis, Carey, 
Foxman, and Tarr, 1968). 

One can select cards, write numbers, and make moves without much 
intellectual effort and usually make some progress. But if one is expected to 
justify his problem-solving attempts verbally, he has to analyze the problem 
in sufficient depth to give names to the variables in view and the relation- 
ships between them. It is easy to believe that training in producing verbal 
rationales will be beneficial when the criterion of achievement is a verbal 
generalization, but in the experiments mentioned here the nonverbal achieve- 


ments also benefited. 


HIERARCHIES OF KNOWLEDGE 


Consider the following analogy problem. 
5 À i A 5 
laconic is to loquacious as hackneyed is to — 


The solution here can be treated as a high-level performance that integrates 
four low-level components. No one would be expected to ‘get the ener 
if he did not know the meanings of the four words. So pitting eie pa 
aside temporarily and teaching the low-level components would presumably 
be an efficient start. 

The notion of a hierarch 
are necessary for performance: 


y of performances, some of which, at low levels, 

ice at higher levels, is implicit in much ce 
: í i ion of concepts, principles, 

on problem solving. Discussions of the functio: ~ ar h e 

d other substructures generally assume tha 

a larger solution pattern, but 


ned most explicitly by Gagné 


(1962). Fig. 5.1 illustrates a hierarchy of nine pe ces Lee a 
volved in the problem that Gagné and Brown ( = Ao pase 
202): finding a formula for the sum of terms ina ep jes. l 

of each of the nine tasks were prepared and given in a fixed sequence from 
the top down, stopping when a task was reached aa a — 
perform. It was assumed that the subject could perform yo s — nt A 
hierarchy, but that instruction would be necessary for those Ur whic 

had been failed. Thus the subordinate capabilities, = noa aut för 
arithmetic problems, are intellectual skills, not necessari y verbal statements 


of principles. 
The implication of complex task 


cognitive schemas, an eg 
Separate components to be organized into- 


i i xami 
hierarchies of knowledge have been exa 


hierarchies for training is that the subor- 
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Task 


Finding formulas for sum of 
terms ina number series 


I. Supplying symbols and operations 
for general equations between 
numerical quantities having porticular 


spatial relations ina table 


all 


IB. Supplying numbers and 
operations for specific equations 
between numbers having particular 
spatial relations in a table 


i 


IIB. Supplying numbers and 
operations for specific 
equations between 
numerical quantities 


ILA. Using symbols to identify 
spatial relationships between 
numbers in different rows 
and columns of a table 


IXA. Using sym-| | IZA2. Using IZAB. Identify-| | 1ZB1. Supplying 

bols to locate symbols sub- ing spatial operations to 

vertical posi- scripts to lo- patterns of make equalities digits in spec- 
tion and iden- | | cate and iden- symbols and for specific nu- ific state- 
tify numbers tify numbers numbers ina merical state - ments of 

in columns of in rows of table ments numerical 
tables tables equality 


Recog- 
nition of 
patterns 
(closure) 


Symbol 
recog- 
nition 


FIGURE 5.1. Hierarchy of knowledge for the task of finding formulas for the sum o 
n terms in a number series. From Gagné (1962). 


dinate capabilities must be learned before the higher ones. By this procedure, 
the learner does not begin work on the terminal performance. The instructo" 
first analyzes the complex terminal performance into the requisite subordinate 
capabilities and teaches these one level at a time. The learning is cumula- 


tive. The final level is practiced when all the subordinate performances have 
been mastered. 


u 
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Some novel empirical consequences of this hierarchical approach came to 
light in a report by Gagné and Paradise (1961) of an experiment with 118 
students in seventh-grade mathematics classes learning to solve linear equa- 
tions. Task analysis by the investigators identified 22 components located at 
5 levels. In the records of performance at each level before and after a 


teaching program, there were four contingencies for testing the theory: 


Success at one level and also at a higher level. This is predicted from the 
theory. 

Failure at one level and also at a lower level. This is predicted from 
the theory. 

Success at one level and failure at a lower level. This is opposed to the 


theory. 
Failure at one level and success at a lower level. This is not opposed to the 


theory; it just means that the subject did not learn to make the integra- 


tion at the higher level. 


When the data were examined in this way, more t 
stances were predicted from the theory. ; 

Describing a structure of capabilities in a complex problem-solving perfor- 
mance is probably easier than describing the optimal method hat learning the 
structure. The improvement in the Gagné and Paradise snore was 
smaller than expected. Learning from the bottom up is not snes y better 
than learning from the top down; it seems to run contrary to the recom- 


mendation to learn principles rather than facts. 
OF course it is easy to make up a logical hierar 


who make certain college courses prerequisite to others, that students must 
e o on to the next. But this cannot always be 
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chy and assume, like those 


y ina logi 


optimal sequence of study- Jation to loquacious; 
z 5 its relati even 
ie ae, comm ye cir functions in â 


teach the components by teach» 


208 *¥ THE SOLUTION OF PROBLEMS CONTINUED 


evidence suggests that we can err by spending too much time on the low- 
level components, and also by jumping up a level too quickly. 


TEACHING REPRESENTATION AND TRANSFORMATION 

Max Wertheimer’s (1945) well-known book Productive Thinking did 
not contain any empirical evaluation of alternative methods for solving 
problems. His contribution was rather to analyze the difficulty of certain 
arithmetic and geometric problems by describing their structure and to 
suggest methods that would reveal the structure more clearly to the student. 
Although Wertheimer did not emphasize the transformation process, many 
of his suggestions consisted of transforming a symbolic representation of the 
problem into a graphic representation. Consider his hypothesis of the dif- 
ficulty involved in the problem of vertical angles: the assignment is to prove 


that angles a and c of Fig. 5.2 are equal. One can proceed algebraically, as 
follows: 


a + b = 180 degrees 

c + b= 180 degrees 
Subtracting a-—-c=0 
Therefore a=c 


According to Wertheimer, this demonstration is hard to follow. Students may 
learn it mechanically, but the occurrence of mechanical errors in repeating 
the proof shows that it has not been genuinely understood. The principal 
reason for the difficulty is that it is hard to perceive angle b as part of a + ¥ 
which equals 180 degrees, and also as part of b + c, which equals 180 
degrees. It is the structure, including the relation of the parts to the whole, 
that must be grasped rather than any particular equation or sequence of 
equations. Wertheimer recommends graphic representations of the problem, 
as in Fig. 5.3, as successful devices for revealing the structure to the student. 
At the same time, Wertheimer’s discussion includes considerable verbal 
analysis and many transformations of numerical equations into ordinary sen- 
tences, 

Solid objects, or construction materials, permit more flexibility in the 
representation of mathematical and physical principles, and many teachers 
have used such materials for this purpose. But a child’s understanding of the 
principles may be limited to the specific materials or apparatus from which 
the illustration was contrived—as the ancient Egyptian surveyors were 


FIGURE 5.2. The vertical angles problem- 
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FIGURE 5.3. Alternative ways of restructuring the vertical angles problem. After 


Wertheimer (1945). 


limited to specific applications. The teacher's task, of course, is to conduct the 
child beyond specific imaginable examples to general principles. Bruner and 
Kenney (1965) have shown how to represent the multiplication table with 
blocks that children can manipulate in order to multiply and divide percep- 
tually and thus see the equality between (x + 2)? and x? + 4x + 4. An 
appropriate notation for representing the mathematical quantities and rela- 
tions, the heterogeneity of the examples, and considerable explanation in 
ordinary Enalish seem to be the critical ingredients for the effectiveness of 
‘ = e that even after older children 


uch learning. But Bruner and Kenney believ 
the mathematical expressions and a 


have acquired the formal properties of 
Systematic symbolic notation, they still utilize their imagery to explore prob- 
lems and relate them to problems already mastered. 
The apparent advantages of the manual-visual mode of representation 
can easily be combined with those of discovery learning. For example, Ander- 
son and Johnson (1966) gave one group of college girls a short, barebones 
Statement of the forces operating on balls rotating about a standard and 
then asked about its application t° a somewhat similar test situation. Another 
group was told to experiment with the apparatus until they understood it and 
then was asked att the test situation. Both groups gave better explanations of 


tlre test situation than a control group: but the perceptual pen of me 
tion was superior and those who were successful “a tose an 
imager ta z ; ‘ similar ones have obvious 
Magery, The results of this experiment and other a 

£ o: but we do not know whether to 


implicati EE blem solving: 
dions for taimina in pence S m ; 
i between the perceptual and verbal modes of 


attribut : i 
e the erence . i 
em to the diffe discovery and expository learning. 


repres i i between 
ent n erence be . : 
ation or to the diff ive representation systems in more 


Scar 7 ] alternative 
andur svaluated 4 terna i N : : 
lura (1967) evaluate in mathematical operations in ordinary 


detail } nee rtair 
by Ww. os for certa $ s 
y writing the rule : with summation signs, parentheses, 


Enolish «al form 

ish ‘ 3 atical torm, 
s che i ey 3 jee A r j mostly college women, learned 

Xp j f 
‘he rte other a: ae rapi e English ern probably 
J ic statements 7 ic i 

ecause the | ‘ere longer: Mathematical expressions that are used 
atter we S pact symbols, which, though less 


Tepeat “ato more compe i 
Peatedly are often recoded int he information to be retained. 


amiliar i esentations 

are more efficient rep" À vat 

But whe h bj sere faced with applications OEN problems, they 
n these subjects W dily as the rules written in 


appli RE Jish just a5 "9 : 3 
Pplied the rules written bi ‘ eN those who were given symbolic prob- 
nel shi 


M; ž $ 
athematical symbolism. 
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; -3 y had previ- 

lems applied the symbolic rules successfully if, and only if, they had par 

i i `rlyi rammar, 

ously been taught the meaning of the symbols and the underlying gr me 

iði jonihe ; pare ave ‘hese results a 

such as the order of operations signified by parentheses. I Ea e ia 
evidence, in general, of cumulative or hierarchical learning. The hig 

accomplishments depend on the low-level ones 


saiia esentation, rela- 
to benefit from some of the advantages of mathematical representatic 
tive to ordinary English, w. 


ithout realizing all its potentialities. 
There is much still to be learned about 
The graphic mode is most a 
some problems, and for th 
forming data from other fi 


The verbal definition of 


, yet the subjects were able 


alternative modes of representation. 
ppropriate for representing the structure e 
ese instruction in drawing graphs and in trans 
orms into graphic form will no doubt be helpful. 


5 a 
a class of figures and the verbal statement of 

i inci i : : srations, are 
physical principle, which may be treated as transformation operations, 4 


helpful in other cases, Hence we may suggest the hypothesis that, aside from 
the appropriateness of a specific mode of representation, any transformation 
rom one mode—verbal, numerical, algebraic, graphic, gestural, cinematic 

to another affords the student a further avenue for penetration to the struc- 
ture of the problem and increases his depth of understanding. Any transforma 


: 7 ; ation by 
tion that the teacher makes would be instructive, and any transformation by 
the student would presumably be even more helpful. 


TEACHING PROBLEM-SOLVING 

There is some ambiguity 
ciple, rule, and strategy, bu y is a general approach. ed 
in the terminology of concept research, a Strategy is not a method for solu- 
tion of the problem, but a method for collecting and retaining relevant 7 
formation, for formulating and testing hypotheses, for representing ap 
transforming the data, for detecting errors, or, in short, a method for finding 
a method for solution. Thus many 


$ ay be 
a of the procedures outlined above may 1 
“ns ered as the teaching of Strategies when they are taught with genera 
1 a si F . “ate 
rae for a broad class of problems, Examples are the half-split strategy 
or informati ki m p 7 nc 
Hci on seeking, breaking up a problem into subproblems, and tl 
explicit verbal statement of inci 


STRATEGIES 


in the psychological usage of such terms as pr” 
t most often a strateg 


y X situations, the immediate production 
c luti s with the chance: 
identify i 


‘te n © to ask questions, yet many observers 
t have been ™mpressed by the i 
and many teachers would like to 
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ement are in- 


cies most in need of improv 
fo} 
activities are 


ae It is possible that the strate 
ir ie a strategies, especially since these preparatory 
ep Gi ooked. As a first step in what he calls Inquiry Training, Suchman 
aa iee films of a physics demonstration to schoolchildren and ar- 
g he conditions so that they had freedom to ask questions and received 
J hildren’s questions were 


prompt, reasonable answers. Taped records of the c 
| also played back to the children so they 


analyze F i 
es vas by the investigators anc 
uld le; 5 ‘ i 
Š ild learn some of the consequences of the various strategies of data gather- 
, 8: In comparison with control groups, those with Inquiry Training asked 
— questions, and classification of the questions indicated that they were 
oeetags of a higher quality. Questions that merely asked the teacher to 
o " k 7 < G = Š c E) 
ie thinking became less frequent after this type of training, and those 
ed at identifying the parameters of the problems more frequent. 


ti The type of questions asked by children is influenced also by the ques- 
ions ; y by a prestigious model. Rosenthal, Zim- 


displayed pictures © 12 familiar objects 
while an adult experimenter 


he objects (e-8» shape) as 


a ked by others, especiall 
f ; = 

x man, and Durning (1970) 
a group of disadvantaged sixth-grade children 


ash a f 
ked questions about physical attributes of t 


criteri 3 P 
iteria of classification. In other groups the model asked questions about 
about relations between objects, or about their 


the function of the objects, Z quesis aaia 
objects the mare inguin = encouraged s$ p br sha same type 
fee is mier Kemen questions in cath BSP SA" Se 
frequent. hen y ee moda m that group—though exact mimicry was D 
on type of new pies were presented, and the effect of the model 
2 questions asked generalized to these also. The success of such 
lready shown how a model in- 
ior, and the like in slanting in- 
parents and teachers slant the 


a experiments, which have a 
Aa i sae sare? aggressive behav 
child's — ing tendencies, suggests how 
The ee e categorize the objects around him. NO 
daltai yao giving information about strategies and principles were 
hiihedbpel iÀ ne comprehensive experiment by Corman (1957). He had 
reve: ines eines work matchstick problems like those used by Katona and 
Peali En of information—much, some, and none—about a method 
a tilet gt he problems and the same three degrees of information about 
S or solution. The information about the method was helpful in solv- 
pede problems, while the information about the rule was more helpful 
ah ee the students were asked to state a general principle. In this 
ie ie Ae the improvement was specific to the criterion that corresponded 
eae ind of information given, but in some other experiments (e.g., 
fires and Smith, 1962) the benefits of teaching principles were not always 
in this way. 


Can young children be taught to solve problems? Previous chapters have 


n st es B > : : 
a the limitations implied by Piaget's stages of intellectual development 
and some attempts to teach children to overcome these implied limitations. 

An attempt by Anderson (1965) to teach 


M r 

p odeme success was reported. 

vO. * š ps i 5: 
young children rather complex problem-solving skills is more encouraging. 
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He gave first-grade children a variety of complicated problems involving 


concepts and principles, such as determining which kind of cowboy is a 


friend of the sheriff and why one pendulum swings faster than others. He 
taught one group the strat, 


egy of varying each factor in succession while 
holding other factors constant and found that their success on transfer tasks 
of the same type, while far from perfect, 
control group. The operations the succes 
to Anderson, those that Piaget and Inh 
is true that these were 
and that they had the benefit of individu 
three times a week for 27 s 


Stages presumed to depend on age and cumula- 
tive experience, As to verbal statements of the relations, the results of the 


ere less encouraging; the trained group gave somewhat more invalid 
explanations than the control group. 
Comparable results were obtained by Stern (1967), who was able to 


teach a simple problem-solving Strategy to third-grade children, about nine 


years old, in six lessons, Keislar and Stern (1970) were also successful in 


Ë . . nA y 
tegies, a complex hypothesis-testing strategy 
to children of very high intelligence in 


esults demonstrated that the most superior 
profited more from the complex strategy while 
ted more from the simple strategy. 
S too general—strategies, such as 
you are doing while you are doing 
sychologists seem to be those any 
as we have seen can be x pological Principles, gid a ee “ 
$ with fair success. In SS sa 
: ah ms, as in football, chess, merchandis- 
ae Politics, and admini i y are utilized by experts in these fields 
aries be taug » usually without firm knowledge of the 
Psychological basis for the; i g 


e should make it clear that improvement in 
a F Y several methods. The critical question 1s 
Certain ty. ng is effective but which method or 
€ctive under specified conditions. 
erbal Seneralization is the nature of the prob- 
t ic: i inei 
5 a', numerical, Spatial, behavioral, or other. The principles 
) son i iu 
rage: ne problems are rather easily comprehended and utilized by 
the subject to guide his effort ic 
8 S, and we mio} 
posed of fami] È 


lem, whether y, 
that apply 
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easily grasped and applied. But the essential principle for some problems 
is more subtle, and considerable manipulation of the elements is required be- 
fore it is comprehended—thus the problem may be solved by trial and error 
or by memorizing single moves before any overall principle is grasped, even 
if given by the instructor. This seems to be the case in many mathematics 
problems, and some principles of linguistic usage are notoriously difficult to 
State, even though children follow them quite well. The obvious implica- 
tion is that generalizations about training for problem solving should be 
limited to problems of the type on which the generalizations were found. 

Another condition that limits our generalizations about teaching methods 
is the intellectual level of the students. The more intelligent students will 
do better, of course, but there is more to it than that. For example, in the 
study of matchstick problems mentioned above, Corman gave his subjects: a 
test of general intelligence and found, as expected, that those above average 
did better. But, to be more specific, the results suggested that those above 
average were able to integrate information about the principle and about the 
Strategy, while a little information about the principle seemed to distract those 
below average from doing their best to solve the problem at hand. Likewise, 
although the stages of cognitive development described by Piaget may not be 
immutable, they cannot be ignored in any comparison of training methods. 
Again, in general, statements about training for solving problems should be 
limited to subjects similar to those on whom the results were obtained. 

A comparison between two methods of teaching problem Bere as F 
tween verbal and graphic representations of a principle, ra se a 
1S an equally potent representative of its class. But it has often a i 
that an excellent specimen of one method is pitted against a mediocre 
Specimen of another. The results must therefore be discounted, to a 
known degree. A comparison of methods that are both optimal and both 
feasible would be the most informative. 


Individual Differences in Problem Solving 


ith a complicated problem, the 
if he looks 


are hidden 
but any at- 


Whenever two or more people struggle w n 
observer will notice differences in intellectual proficiency and, 
closely, in intellectual style as well. These individual differences 
when scores are averaged for comparisons between conditions, ; y 
tempt at a thorough understanding must bring them out into the open. 


PROBLEM-SOLVING ABILITIES 
Differential psychology and experimental psychology have bori las 
Sophisticated methodologies and, while there is no real a mi i 
tween them, emphasis on one to the exclusion of the other F o E 
Spite of efforts (Gagné, 1967) to bring the two a pe ste ar 
study of individual differences, the dependent variables of prol g 
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(such as number of problems solved, time to solution, a oye eer 
employed) are analogous to test items, hence the terminology and 
of test construction and analysis are appropriate here. o — 
The first question, of course, is the ubiquitous criterion apesin trees 
problems shall we accept as measures of problem-solving pralidieno ie 
is no standard catalog of problems from which a representative sample n i 
be drawn. Most of the problems selected for the experiments reported ear = 
in this chapter were selected for methodological reasons, that is, igs 
certain experimental manipulations were feasible, discrete moves coulc 
observed, or some other research requirements could be met. Such yon 
may be suitable, secondarily, for disclosing differences between means o 
two sex groups or three or four age groups, but they 
satisfactory scores for individuals, whether perform: 
of success and failure or solution time. It is in the nature of problem solv- 
ing, by definition, that a subject may hit immediately on a successful ap 
proach to one problem and then get stuck on sever. on 
to the next. Because of this large variability, performance on a ag 
problem is not a stable index of a subject's average performance. When 4 
or more problems are attempted, this variability is reduced, proportionately, 
but correlations between scores for 2 problems are generally rather low. 
The sum of the 2 scores corresponds, in psychometric terms, to the score 
on a 2-item test, whereas most reliable tests consist of 20 or more items. 
The variability between problems can be reduced also, at least for group- 


r ~ontrol 
ing problems of the Vygotsky type (see Chapter 3), by a careful contro 
of conditions and opti 


! i a Tuni f 

mal spacing of the clues that aid the elimination Ms 
i : s Y 
Incorrect hypotheses (Brunk, Collister, Swift, and Stayton, 1958). Unde 


. Poe ss . : H BY m- 
optimal conditions of administration, the correlation Cphi) between perfort 
ance on two problems w: 


as .77; under nonoptimal conditions, the correspond: 
ing correlation was .10. 

The obvious way 
number of problems 
thermore, since l 
form 


do not necessarily yield 
i Pig aoe 
ance is measured in term 


‘al unsuccessful approaches 


f aper- 
me problems are corrupted when put into pape 


A + eallectiial 
are not. After all, much of the important intellectu 
one does js Paperwork, 


ewan: OF 

h Seldom does anyone suspend pieces f 

oo . . g i air $ 

i e ceiling and tie them together with the aid of a pair ¢ 

pliers, © 

The critical question concerns the n 
problems, When lab 


. M arious 
ature of the difficulty in the variou 
; oratory problems ce 

tion of set or functi 


` ipula- 
mstructed for experimental manipul 
A „js a 
a TOR 3 onal fixedness are converted to printed format, there is 
anger that i 3 Naa a case 
f th ti m i content will be destroved. An illustration is the Lye 
ot the water-jar Problem for studying Mey “ap devisec 
Sting am a ; Ying set. Frick and Guilford (1957) ai 
a group © water-jar test as a measure of flexibility, but its loading 
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on the factor—measured by the correlation of the test with the factor— 
which they called Adaptive Flexibility was quite low. A high loading would 
further the understanding of both the water-jar test and the Adaptive Flex- 
ibility factor, but a low loading of a modified test cannot be clearly inter- 
preted. 

Even the relatively simple matter of level of difficulty raises questions of 
he time scores for a test of easy ana- 


interpretation. A correlation between t 
even though each test 


grams and one of hard anagrams was rather low 
had satisfactory reliability (Sargent, 1940). 

The ideal criterion of success in solving problems in general could be 
constructed by following the procedures for constructing a test of general 
intelligence. That is, one could begin with a large variety of items that 
clearly fit the definition of a problem, refine the test by eliminating those 
that do not correlate with the rest, and thus maximize the general factor. 
(1934) gives some evidence for generality. 


He gave 60-minute tests of problem solving in eight academic areas: geometry, 
sociology, geography, and history. 


\n early investigation by Billings 


Saag physics, mechanics, economics, es 

Since he wanted to test problem solving separately from a mastery of the 

material involved, he gave his subjects instruction ™ the materials in ad- 
s 


vance of the problem solving itself. The average intercorrelation of the scores 
s g ; tion 
67. The problem-solving criterion scores had 


obtaine. 3 
tained by this technique was - y c 
with general intelligence as measured 


hich co à 3 
igh correlations, as one might expect 


by the A 
: rmy Alpha. 
Actually, we will have to admit that the choice of problems for a study of 


individual differences is a decision made by each investigator in the light of 
his own interests and hypotheses just as it is in libio a a 
Problem solving. The results apply t° sample of pro s ra 


the 
an to problem solving in general. 


a restricted 


SEX DIFFERE 
4 F FERENCES i eftort, are 
various kinds of difficult problems, demanding concentrated ffort, a 


Siven to students in high school or college, the male students aey ma 
more often than the female students (Bedell, 1934; Biling Si ce the 
and Burke, 1967; McNemar, 1955; Staats, 1957; Sweeney, 19 h fie ihe 
girls’ grades and general intelligence are at least as good as the : a om 
resources that they bring to the problems from previous lag ta 
eficient; the search fae i interpretation of this sex difference 

turns to their motivation or attitude toward problem solving: 
hen college students were given a test of masculinity ° 


mad women, solved more dif- 
ade more masculine scores, 


ficult : students who scored high 
re pce apa Ta omg «tion worked longer on 
pievi 


a projective m £ 
i ve measure of ac 1958). Th 

a . The 
difficult problems and got more right French = Thomas, ) 

Dvious n : ase the moti 
ext step would be to try tO increa! 5 : 

Women, and ca ae (Carey, 1958) took the form of a group te 
°F attitudes toward achievement after a problem-solving session. The fac 


£ interests, those 


who 
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that the women did register some improvement on a second session giis 
the men did not supports the hypothesis of a sex difference in motiy ene 
Another attempt (Milton, 1959) took the form of a modification. o pa 
wording of the problems to make them more appropriate to the ferainn $ E 
For example, a problem that began: “Snuffy, the tramp, rolls his ped 
cigarettes from butts . . .” was changed to: “Sally, the cook, cuts coo sies 
from batter . . . [p. 705].” Such changes in the problem materials did reduce 
the sex difference somewhat. 

These inquiries do not settle the question of sex differe:.ces in problem 
solving, which have been reduced but not eliminated; and there may be 
stable sex differences in specific intellectual abilities. But the results to date 
suggest that this question belongs in the area of the social psychology of 
motivation. The methodological implication is that a sex difference may ob- 


scure the results of other variables in problem solving. If a sex difference 
sometimes appears and sometimes does not, 


points to subtle motivational aspects of the 
the nature of the relationship with the exper 


this transitory character also 
experimental situation, such as 
imenter. 


CONTRASTING GROUPS OF GOOD AN 


D POOR PROBLEM SOLVERS 
If we could identify a group of good 


-asting 
l problem solvers and a contrasting 
group of poor ones, it would be enlightening to compare them on other 


variables. Acting on this plan, McNemar (1955) selected four tests, judged 
best for her purpose by several 


soning. These were tests that h 
problem solving . . . and low variances on verbal, numerical, and perceptual 
factors [p. 21].” From a sample of 488 college sophomores who took these 
tests, she selected 73 with high scores and 73 with low scores for comparison 
on other tests, (The significant sex differences necessitated different cutting 
scores.) The important findings were that the good problem solvers were 
superior on word fluency, especially fluency in supplying words to fit a 
meaningful criterion; on induction and deduction; and on overcoming a set 
on the well-known water-jar problems. 
High and low 


otunian (1959), ali an masten, Alko by Vle; anien a ae 
shying cam i en verbal and abstract ee ae 
age about thirteert The Problems to 500 pupils in grades 7 and A ee 
Fa e correlation of the composite score on these te 

ane’ Q was .645, hence s clear that genera] intelligence predicts problem- 
solving success moderately well. But the rediction RE far from complete, 
so the analysis was carri p i 


ing 

Pela ed further. Those subjects whose — 
scores were higher tha i : n 
8 n predicted from IQ were designated as good proble 


solvers and those whose scores were lower as poor problem solvers. This 
procedure identified 117 go 


average TQ. od and 117 poor problem solvers of the same 


psychologists, from previous research on rea- 
. . n 
ad “the type and level of reasoning involved ir 


iti 
1 


seat SOUPS Were compared on a variety of tests of more 
specific intellectual functions, th 4 


i © good problem solvers were significantly 
superior on nearly all tests where quality of response, accuracy, or judgment 
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was required; and, without exception, the more complex the task or the 
more restricted the requirements, the greater their superiority. 

We have noted that large-scale investigations of individual differences 
typically employ test items adapted to the requirements of group testing, but 
Tate, Stanier, and Harootunian gave problems of the type employed in 
controlled laboratory experiments, such as water-jar problems, functional 
fixedness problems, and disk-transfer problems, to random samples of 50 
of the good and 48 of the poor problem solvers—in 3 hours of individual 
experimentation. The good problem solvers were generally superior on these 


also, especially the more complex problems. 

alysis of the large sample of 500 was 
1960). The correlations of predictor 
ing criterion were: reading .732, 


A more conventional correlation an 
also performed (Harootunian and Tate, 
tests with the composite problem-solv 
judgment .707, and problem recognition 624. The results, taken as a whole, 
give some support to the hypothesis of a general problem-solving ability which 
has much in common with scores on a conventional measure of general 
intelligence but which remains when general intelligence is statistically re- 
moved. The tests with the largest independent weights in predicting problem- 
solving scores were tests of judgment and of problem recognition. 


GENERAL INTELLIGENCE, REASONING, AND 


PROBLEM-SOLVING ABILITY m ° R i 
A good argument could be put up for considering general intelligence as 


the ability to solve the general run of life’s problems. Ordinary intelligence 
tests are composed in large part of minor problems to be solved and of abilities, 
Such as attention span, that contribute pervasively to the solution of problems. 
If we take success in life as an index of the ability to solve the general problems 
that emerge, the evidence of the importance of general intelligence is im- 
pressive. Life does not offer everyone a standardized set of problems, to be 
sure, but even so it is undeniable that those who score very low on intelligence 
tests, the mentally retarded, do not solve their problems very well, even if 
born with a silver spoon. Those who score very high run far ahead, in al- 
Most any contest, of the mentally retarded, and even of average citizens. The 
remarkable group of 1,500 boys and girls that Terman followed for several 
decades must be considered good problem solvers by any reasonable standard 
that could be applied (Terman and Oden, 1958). It is true that most of them 
started their careers with middle-class 


advantages, but it is also true that they 
were picked out of the large middle class by scores on the intelligence test. 
to) 
Niele ; ra 
he many limitations and misuses of the gene 


l intelligence test should not 
i 4 iy ch none of its rivals have demon- 
obscure its general predictive validit hick 
Strated (McNemar, 1964). 


The same statements could be 
to general intelligence, such as vere 
easoning items have been included in t 
Start. The fact that reasoning problems 
Problem-solving ability reflects 4 common 


yw 


made for other tests that are closely related 
bal comprehension and reasoning ability. 
ests of general intelligence since the 
appear often among the criteria of 
understanding among psychologists 
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i i er processes. 
ji f the contrast with simpler proce 
of the nature of problem solving and of the eb I eer 
h ing tests correlate with success on various types I 
The fact that reasoning } Pe aa gale 
lems, including those that consist mostly of seeking inform: š 
generality of this ability. 


AN ILLUSTRATIVE FACTOR ANALYSIS EO E 

But we can admit the usefulness of such broad measures as genem = 
telligence and general reasoning and still seek a more detailed sent : 
of the contributions of narrower abilities. It is possible that some interesting 
relations are obscured by the use of such general measures. The procedure 
typically is some kind of factor analysis of the correlations between scores on 
tests designed to measure single functions. ; 

Guilford and his associates have carried out factor analyses of planning, 
evaluation, creative thinking, concept achievement, and reasoning, any ok 
which might be called problem solving. But the advantages and door ae 
of this approach are well illustrated in an analysis by Merrifield, eee 
Christensen, and Frick (1962) explicitly labeled The Role of Intellectua 
Factors in Problem Solving. Instead of studying all problem-solving processes. 
they limited their study to the 


production proc 
pletion format. 


which requires the com 


Three problem-solving tests were 
the initial information and the go 
The instructions were to indicate 
initial to the final events. 
think of three words to cor 
words being given. There 
6 minutes were 


used, all including problems in which 
al or desired situations were clearly stated. 
the Necessary steps in proceeding from the 
In Missing Links, the subject was required a 
mplete a chain of associations, the first and fifth 


ie as ns. and 
were two parts, cach consisting of six items, @ 
allowed. In Predicaments, 
and a solution required, with the 


two or more of four give i 
three items cach, and 10 minutes were allowed. In Transitions, the beginning 
and ending of a very short story were presented, and the subject was asked 
to compose the middle part, making the total as complete and logical as he 
could. There were two parts, of one item cach, and 19 minutes were allowed. 
A large number of other tests of known factor content were also given for 
reference, 


When these 
the reliability co 


s a problem situation was de cre 
ER a 
restriction that only some combination f 
j Th arts, 0 
n objects could be used. There were two parts, 


ests were given to 200 nav 
efficients, cor 
were, respectively, 58, 55 
that total time w 


al cadets and officer candidates. 
mputed by correlating the two parts of each is 
, and .46. These reliabilities are good, considering 
and that the score 
S. Nevertheless, 


as short 
by one or two judge: 
in solution of thes 
analyzed w 


: solutions 

S represent ratings of the ae 
R is individuals 

most of the variance between indiy idu 


o n : i as 
© problems escaped the analysis. The variance that wa 
: as quite well accounted for by combinations of the 16 mee i 
intellectual factors, and thus no new problem-solving factors were identike 
ve n ae eS 2 s quay rere relatec 
For example, scores on Missing Links, with a reliability of .58, were relat 
to the familiar verbal compr i : 


se zp ic 
also called cognition of semanti 
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5, were analyzed into two factors: conceptual foresight (also called cognition 
of semantic implications), with a factor loading of .32, and sensitivity to 
problems Calso called evaluation of semantic implications), with a factor 


loading of 31. 
Factor analyses of this type describe the relationships between problems 
ar as individual differences in successful 


and systems for classifying them so f: 
solution are concerned. The results obtained reflect, of course, the range of 


ability of the subjects, the problems included in the test battery, and the 
type of factor analysis chosen. (There are other types of factor analysis, 
starting from different assumptions, that would disclose a common factor in 
test batteries like the one above, which would then be given a general 
designation, such as General Reasoning). To anyone who understands this 
methodology and the nature of the previously identified reference factors, 
the obtained factor loadings help to describe the abilities required for success 


on the new tests. 

The principal limitatior 
the three problems chose 
Problems could have been chosen, 


a on this illustrative factor analysis is the nature of 
n as a criterion of problem solving. Many other 
including some in the test battery, such 
as verbal analogies and logical reasoning problems. Those chosen for investiga- 
tion slant the results of a factor analysis of individual differences just as 
they do the results of an experiment directed toward general principles of 
problem solving, 
LEARNING R ING AGAIN 

ig PROBLEM, Se of problem solving on previous learning 
can be asked in various ways, and the experimental research a earlier 
Sives some of the various answers. Analogous questions can be as ked of in- 
dividual differences also and such questions were often investigated in the 
1930s and 1940s, whet ‘achievement tests were new and exciting and factor 


(Johnson, 1955). 

roblem solving, a 
a test of arit 
moderate correlations were 


e reading and interpret- 
problem solving, were 
t past learning, the cor- 


lso called arithmetic 


MNalysi s 5 
aly sis was not vet in common Use 
hmetic computation 


Pe scores on a test of arithmetic P 
Sear ee NE correlated with scores on 3 
“Sumed to represent the results of past learning, ™ 
obtained, usually in the -40's When tests that requir 
we dificult paragraphs, called verbal aber 
a related with vocabulary tests assumed a s 
ve students study facts and prin- 


Ane ap 
ther way to approacl ] on these. The scores for recall of 


tiple a 
Ht al T then work problems base problems based on such materal 
aterial just lear f lution O E 
; st learned and for St 24. Billings, 1934) Boys were 
also cop Bede 1934; Billings, = ROYS 
orrelated i soidade range Leesen n -< in agre i 
8H or il in the midd Be eT plem-solving tasks, in agreement with 
erally superior to girls on the pee he recall tasks 
1 _ bi è p e reca a 6 
nore recent evidence noted above, but pae i fe to conclude that the more 
Within any limited f knowledge» it is sa á A 
anv limited area 0! č problem from their previous 


Succes s à o the 

ess : h re A 

ear ssful problem solvers bring mor, between ability test scores and prob- 
ning. The conventional correlation 


w 
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lem-solving success reflect the contributions of aptitude and es 
Task analysis of success on specific problems shows up the A a l i 
more specific subordinate capabilities to success on tasks at a hig her: Jey ov 
the hierarchy. We can say, in terms of sampling, that ordinary a a 
achieve some predictive validity by sampling widely but sparsely. To pred a 
success on a specific set of arithmetic problems, we would have to iy 
the requisite component capabilities more thoroughly. All of this does i: i 
mean, however, that individual differences are completely determined z 
previous learning or that the individual does not learn anything while engage¢ 
in solving the immediate problem. _ 
The relation between learning and problem solving goes both ways. Does 
the sex difference imply that the optim 
be different from the optimal method 
problems are usually small, 
whether the results of factor 
Those individuals who are 
while those high on other 
As Anderson pointed o 
describe the skills involv 
possibility of alternative 


: vould 
al method for teaching women would 

: s za aalge 
for men? Sex differences in solving 


were successful because 
because of high ability 
might not be a good gui 

The other difficulty 
possibility of transform. 
problems by 
data into nur 
numerical pr 


i ‘ ifficult 
know that success in solving difficu 
fixedness list 


l ability. The discussion of functional 
hich organization of a solution out of oF 
problems get difficult the possibility a 
ities must certainly be considered in the 


hard, and training programs must encourage such versatility of approach. 
Since the factor analy: 


sts, especially Guilford and his associates, have con” 
structed many problems to meet the specifications of factor analysis and have 
described their factorial it is possible that controlled laboratory €% 
ould be more generally enlightening we 
e traditional problems. Experimental manipulations 
minantly loaded on one factor might have different 
effects from the same manipulations applied to problems loaded on another. If 
evidence for the efficiency of differential training methods suggested by factor 
loadings could be obtained, it would have practical value na would validate 
the factor analysis. 
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At present, from the results of tests given only once, it appears that 
general intelligence is a rough-and-ready guide to problem-solving ability, but 
that reasoning ability, which can be analyzed out of general intelligence, is 
better. This reasoning ability, as identified by psychological research, does not 
differ greatly from reasoning as understood by educated people in general. 
It appears in tests of arithmetic problem solving, syllogisms, sensitivity to 
problems, estimation and evaluation from incomplete data, logical judgment, 
and the like, and can be further analyzed into narrower components if 
ry. To sum it up for the present, the general reasoning factor found 
by factor analysis “is best defined as the ability to comprehend or to structure 
problems in preparation for solving them [p. 17)” (Kettner, Guilford, and 
Christensen, 1959). However, the importance of individual differences in 
perimental evidence on reasoning justify a separate 


reasoning ability and the ex 
chapter later. 


PROBLEM-SOLVING METHODS AND STYLES 

It is customary to differentiate ability traits, indicated by responses that are 
easily graded as right or wrong, from consistencies in responses that reflect 
individual differences in motivation, style, social habits, work methods, and 
other aspects of personality. This has been a useful distinction, by and large. 
But the difference between ability traits and personality traits is not as 


conspicuous as it once was, and an adequate understanding of the production 
of correct solutions may involve clarifying the role of other response con- 


Sistencies, 
a The methodology of this researc} 
he basic question is one of consist 


h follows that of research on ability traits. 
ency over time and consistency, or gener- 


ality, across problems. Relations with other traits, es ond a | 
traits, have been investigated, usually later, and ed ae aie ee 

Cognitive styles are frequently woven into an me rol ery o ae 
about personality differences. Pulling out single threa Sitor E iy oes 
not display the fabric to advantage, put a few illustrations wi serve as an 


i vant. 
introduction to the current styles that are most rele 


: ouGHT , 
FLEXIBILITY OF ACTION AND Th jem-solving processes described 


Individual diff i of the prob 
ndividual differences in some cted ARRAN The research on set led, 


in the pr ` bai eee ae 
: receding chapter have ae i s in rigidity 
m the i940: and 19508 to studies of individual ME 
erbil sa ek ; À n 
xibility ses “ some p” à 
maple E, i nai a ke at eli of this trait ere ‘pa appeared, 
s. Little evidence for § mili: red with genera reasoning 
owever, and the common variance seemed to be pe a on ae 2 
rather than with a general flexibility oe. l l 
Joh AiR kerman > A " 
en 1955; Levitt and pe ’ «the difficulty of fooling people twice. 
he essential psychometric © ? -oct into adopting a set and then 
be trapped again. It is probably for this 
o 


If the sequence of problems 
shifting set 07 a variety of problems, 


hifting from it, he is not likely t 


reason that a factor of flexibility in 
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independent of general ability or general reasoning, has not yet been demon 
strated. 
Cognitive flexibility is a much more substantial f. 
the influential work of Rokeach €1960) 
on a printed test of dogmatism, de 
correlate with scores on printed t l c aaa 
belief systems, but also with individual differences in certain observ: i 
aspects of problem solving. The open-minded, who get low scores on th 


i 0 ith a new belief system that 
Dogmatism Test, are more capable of dealing with a new belief syst 
is at odds with a previously | 


formation into an integr. 


actor, as demonstrated by 
on open and closed minds. Scores 
vised to compare individuals in thi ; 
tests of attitudes, values, and changes in 


respect ` 


y iina na it 
eld belief system and of synthesizing new i 
ated whole. 


INFORMATION-SEEKIN 

The methods by which different individuals colle 
solution of a problem can be observe 
aspects of this information-secking pr 
ences. We noted earlier that diffe 
observed and that those sub 
Strategies scored higher on te 

The printed test format has also been adapte dfn. 
seeking preferences (Cross and Gaier, 1955), Pulling a tab disclosed either 
a principle or some facts pertinent to a set of balance problems. The tendency 
to seek facts was negatively correlated with the fad 
a preference for the latter was related to mathematical ability and predictec 
achievement in an Air Force mechanics course as well as conventional pre 
dictors (Gaier, 1955), 

Rather large differences can be observed in the 
different People collect to solve 
tion and they jum 
data are all in, 


fig ct information for the 
d without much difficulty, and PAROUS 
cess can he treated as individual higo 
rent troubleshooting strategies could be 
jects who adopted the 


more 
sts of general reasoning, 


sstematic logical 


-iraan 
d to the study of informatio 


‘inciples: 
tendency to seck principl 


‘ i hat 
amount of information t ; 

`; ittle informa- 
a problem. Give some people a little ae ie 
P to a conclusion—intuitive thinkers. Others wait unti 


tes z ; 61) 
In order to study these “intuitive leaps,” Westcott (196 
prepared problems with five clues hidden under tabs 


> clues 
and presented the clu 
by allowing the ct to lift one tab at 


j ; . » follows: 
subje atime. One simple example follo 


4:2, 9:3, 25:5, 100:10, 64:8 Complete the analog 16:__, 
Verbal analogies and series Problems were given to college students, a 
were instructed to work the problems with = few clues as possible. = 
Principal results were the rather hioh reliability coefficients, which mer 
that the demand for a lot of information or a little was a stable aspect of aia 
subject's Problem-solving stvle. The information demand for verbal problems 


i$ a a 1 ic rob 
= moderately Correlated with the corresponding score for numerical p 
ems. For the Stroup asa whole, more problems we 


information Was exposed; but for individuals 
tions between information-demand scores 

More recently, Westcott (1968) has ce 
from pictures t} ery sketchily 


hat begin vi 
for informati 


re solved correctly when oi 
there were negligible correla 
and problem-solving success. E 
mstructed corresponding ‘oar 
and progress toward greater comple 


i : a 
ness, Corresponding scores on demand can be obtained, and thes 


INDIVIDUAL DIFFERENCES IN PROBLEM SOLVING * 223 
too are quite reliable. But the correlation between information-demand scores 
on the two types of problems are negligible, hence it appears that these are 
rather narrow cognitive traits. 

Rokeach’s theory of dogmatism has implications for information seeking, 
which Long and Ziller (1965) tested by similar techniques. On a series of 
Speculative questions the frequency of “Don’t know” replies and dogmatism 
Scores correlated —.32. Delay in making a decision also correlated negatively 
with dogmatism. The open-minded, nondogmatic person is likely to seek 
More information, according to the authors, while the dogmatic person 
protects the information he has in order to maintain his belief system. 

The in-basket technique, which has been used in studies of administrative 
performance, has also been adapted to the study of the information-secking 
styles of teachers in training. Shulman, Loupe, and Piper (1968) placed a 
variety of realistic pieces of information in the in-basket to simulate the prob- 
lems facing an elementary teacher on the first day in a new school and the 
resources available for dealing with them. They were able to identify a 
group of dialectical teachers (in contrast to the didactic) who were more 
Sensitive to problems and spent more time on inquiry, during which they 
juxtaposed different sources of information and shifted their attention among 


them, 


CONCEPTUAL DIFFERENTIATION 


Early studies with free sorting tests and other techniques that allowed the 


Subject to choose his own categories showed that 1 subject ee use 30 
Categories to sort a pile of objects that another would sort into categories. 
I «og were said to have a narrow equivalence 
nose who used many categories were 
€ é i 1962) questioned whether those who 
range for each, Gardner and Schoen ( )q ) i bac 
exhibited a broad equivalence range by this any were e ua y 4 
k i studies 
Ccptualizing all the items included in the large group. I er in writ ea 
On conceptual differentiation they required the subject to de ne : i ; 
f 3 > 
among all items in a group. To some subjects a large op E E ; jects 
§ 5 3 
turned out to be a wastebasket category, while 1 oeae Toa ae fete 
mariti = 
8enuine conceptual differentiation me ampan ae control, that 
ences, This procedure disclosed a fairly sables pen osed to other differ- 
was best measured by conceptual differentiation he oe . 
€ntiations (such as merely listing concrete particula } 


i 5 ility to abstract. 
dependent of general intelligence and ability 


THE REFLECTIVE STYLE k on a problem have been noticed in children 
ariations in rapidity of attac% OP ? in motor performance. 
as well as adults ad A solving piben prepir ae eae 
7 illips 
; agan, Rosman, Day, Albert, een H x Geja] pronpiag pokles 
iati “71 : n a > . 
es among children in grades nhibition of incorrect solutions 


A for i 
to Control over impulsivity which allowed ‘aeons Shien ni 
and reflection over alternative possibilities. 


oO se times on similar tasks given 
Ver t; ] ith respon tm Ò 
asks and consistent also W 


3 s 3 and 4. 
ater when these children were in grade 
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The development of control over impulsivity between “pe Seat ok 
to the production of more concepts of an analytic type, and je 7 D 
could also be produced by instructions to slow down and reflec 
alternative possibilities. This is probably the 
categorization outlined in Chapter 3. The ben 
with the benefits of the stop-and-think strategy 


same as the analytical style of 
efits of reflection are consonant 
for adults. 


COGNITIVE COMPLEXITY 


i i ; s > achievement when 
Individual differences in concept learning measure achievemer 


everyone is trying for the same cognitive achievement. How can we TEA 
individual differences in cognitive structures when different individua s yi f 
different structures? One stylistically prominent attribute of cognitive ae 3 
tures is complexity, which is a Property of the structure rather cies 
content and therefore js applicable to many different structures. When aie 
paring two countries, for example, one person may compare them on 
dimension, standard of living, while another compares them on three sata 
sions, standard of living, birth rate, and area. Number of dimensions is € 


p È RR ies, foods, 
simple measure that has been applied to the cognition of countries, f€ 
persons, and even motherhood, 


A variety of Procedures have been 
reveal the complexity 


procedures are quite Cc 
about an object or to 
gorize them in group 
the subject has used 


uses several dimensions to characterize an 

complex cognition of that object than the lex 
‘ s P plex 

the black-and-white thinker who uses only two extremes. In a comp 


. . . iti X ple 
problem-solving situation, for example, the more cognitively complex peo} 
ask more questions (Karlins and Lamm, 1967), 


invented in order to get people i 
of their cognitions of objects of interest. Some of ee 
omplicated, but in general they ask the subject to weite 
evaluate several on alternative dimensions, or to cate 
s. Then the traits or dimensions of evaluations that 
to differentiate them are counted. The subject pa 
object presumably has a mor 
subject who needs only one ¢ 


oy x BIA „neral 
ed that cognitive complexity is a ger ax 
; anno} 
ints toward several narrow factors. Va 


xtracted. He concluded we 
may consist of several relatively 
pparently, Cognitive complexity is no 
ardner and Schoen, 1962). 


+ 1962) observed that some peop" 
perceptual tasks and thus can K 
> as on the rod-and-frame test and the embedd = 
» unable to analyze the elements, take a ape 
global approach, This di ension—extending from field independence a 
; , however, but ehane 
neral term is analytical n5 
continuum, as Opposed to global field approac 


ities as 


approach, at one end of the 
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Children of ten and twelve who got high scores for field independence did 
particularly well on the problems of the Picture Completion, Block Design, 
and Object Assembly tests of the Wechsler Intelligence Scale for Children. 
Among college students, the analytical field approach was associated with good 
performance on problems of the type devised by Duncker and Maier and 
described in the preceding chapter. High school boys who got high scores 
on the embedded figures test achieved figure concepts much faster than those 
Who got low scores (Davis and Klausmeier, 1970). 


THE ABSTRACT APPROACH 
i Sorting tests have been given to neurops 
instruments for several decades. The patient ma 
on the basis of color, putting the green objects in one pile, 
and so on. Then he is asked to group them in some other way, or the 
arrangements may be such that the first grouping is obviously inadequate. 

hus he has to shift the cognitive side of his set and group according to size, 
shape, function, or some other attribute of the perceived objects. Most patients 
with cerebral lesions do not perform as well as normal subjects on these tests, 
also handicapped (Fey, 1951; Hunt and Cofer, 
results in a fixed concrete approach, or set, 
as the capacity to view the task 
one example of several possible 


ychiatric patients as diagnostic 
y group a variety of objects 
the red in another, 


and many schizophrenics are 
1944). The pathology apparently 
to the test materials, The normal person h 
More abstractly, his first approach being only 
approaches, so that he can shift to another when necessary. . 
Like many other dichotomies between the normal and the abnormal, this 
one can alto be treated as a dimension—extending continuously from the 
abstract to the concrete—to permit more refined measurements. Harvey, Hunt, 
and Schroeder (1961) have gone beyond the clinical interest in abstract 
it as part of a general theory of personality. The 
a more complex, differentiated system of con- 
experience. Thus he is not so prone to 
rete thinker. 
Neomplete sentences provide a method for png a eg GF 
°€ rated on the concrete-abstract continuum by such entea en ul 
°valuativeness, and simplicity. One study (Ware and pi rite ben Ke 
aall number of subjects, whose statements bad er e a y Saan s oe 
Ou 7 ai ion abou > 
apa rapa renee and with P Pren nee 
i iects manifested a greater nee 
abstract” subjects. The more snare sali a aera ong 
Cognitive consistency, presumably pee Aicting dake 
Ordinate concepts to encompass superficially conflicting 


4 erceived blocks and 
he early work abstractness in sorting tests used p 
‘arly work on abstra going by the same name, may have 
, or? 


. : 

ther objects. The more recent concept 89°" han with the early notion of 
More in common with cognitive complexity t ) 

abs i ý 

bstract attitude. 


thinking to incorporate 
abstract thinker is said to have 
epis for categorizing the data of his 
absolute evaluations as the more conc 


styles and problem-soly- 


: itiv 
resting cogn! z 
a 8 ]l analyzed. Traits that 


lta ; int 
ppears, finally, that many yet, not we 


ing methods have been described but, aS 
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7 alyzed into narrower traits, and 
PP eeh amkeni be ir be the same. = Loree 
E at ret 1 ell analyzed. It is possible a 
with ability traits also have not yet been we sal O alll correlate 
oe a ngkoe laal k ieS and the emek 
ra en Te ne by Dunham, Guilford, and Hoepfner (1969). a eee 
vest nat the construction of ability tests has ne Nil ap 
the injection of new ideas, coming perhaps from problem-s g 
and cognitive styles, is needed for further advance. 


Major Trends and Implications 


A ical clarity: 
Problem solving has been distinguished from learning for neopets, 
but experiments on training in problem solving have both iad 
practical implications. Problem solving is seldom taught as an De N 
whole. Typically, the enterprise is conceptually divided into the s + past 
of certain components, such as concepts and principles, that arg T lation 
ents of the solution, and the integration of these into a solution. In site 
to the structure of the solution, the components are substructures; in rele 


Š 
$ "i anized from th 
to the elements of the problem, they are higher units organized 

elements and manipulated as unitary patterns. 


When the components have been 
into a solution is the 
previously learned, the learner's task 
complicated. Analysis of a con 
component capabilities 
hierarchy of knowledge 
ing the components is g 
gies is also helpful. Th 
teaching the 
be achieved 
as well as fr 

Discovery 


achieved in the past, their oe 
principal difficulty, But when they have ve ace 
and the teacher's task are oT ly a 
nplex intellectual performance ang 
appears to be useful to the teacher in clari a 

required and in pl 


OS, each- 
anning training procedures. T 
enerally 


aleine strate- 
helpful, and teaching Lenape a 
3 ifv the necessity 
e evidence at hand does not certify the ne ge 
: capabilites: C2 
components first, because the subordinate capabilitic 


wn 
é h , » top do 
during problem solving. Students can learn from the i 
om the bottom up. 


Procedures are not the 
large amounts of information, but tl 


teaching Principles and str 
discovery 


' f 
most efficient for the emur fot 
hey seem to have special ieee ag 
ategies that apply to problem solving. The con- 
re the student to manipulate the ama that 
cepts and to derive principles and state them explicitly. The i oars ee 
e learned mechanically can be overcome by ee to 
problems that necessitate shifting from one rT 
another, . ithmeti€ 

The use of manually manipulable construction materials in en o 
problem solving often confounds the representation and mtm the 
for discovery of relations. Personal discovery aatica 
l, but so is practice in transforming mgen that 
relations to a spatial mode of representation. Direct comparison indicates 


data with the Opportunity 
relevant relations is usefu 


ug 
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the representation of rules for arithmetic operations in ordinary English has 
the advantage of familiarity; but representation in mathematical notation has 
its own efficiency, not always completely realized by college students. This is 
another area in which training on low-level components facilitates the high- 
level performance. Aside from the advantages of a specific mode of representa- 
one mode to another, while 
preserving the structure, may be expected to be generally facilitative. 

i strategies, such as “Stop and think,” have 
d. But the strategies that have been most 
-solving operations, such as seeking 


tie aerie : r z r 
n, practice in transforming data from 


Instructions about very general 
some value for the unsophisticate 
helpful are directed toward specific problem 
information, stating generalizations, and testing hypotheses. 
of a solving at present is much like prescientific 
successful than others; and much good 
advice can be had for the asking. But the determination of optimal procedures 
for a specific situation requires measures of the critical factors in the situation 
and in the students and a consequent adjustment of the training procedures. 
e problem solving is a complex system of interacting processes, as seems 
likely, success requires at least a minimal level of achievement on the con- 
Stituent processes. The system often includes alternative paths to the solution, 
however, and suboptimal achievement at certain points can be compensated 
by superior achievement at other points or corrected during the gencral effort. 
Iving systems are quite tolerant of deviations from 


i The teaching of problen 
arming, Some teachers are more 


i other words, problem-so 

Optimal procedures. : Hack i 

Measures of individual differences in problem-solving abilities require a 

different methodology from that of problem-solving experiments. There is 
Oo 


no catalog of problems that can be taken as the domain to be sampled. But 
ed of a variety of problems, some evidence 


whe f 
k hen printed tests are construct f E PAF al 
is found for oenerality of problem-solving ability, correlated with genera 
i n ò a P 3 P P s, = 
intelligence and other broad abilities. The problems that appear on printed 
fests are not usually the ones that r in problem-solving experiments, 
rae. ioh on the former also score high on the latter. If 
hig s x 
3 bility, the reasoning factor contributes 
h printed tests yields some- 
rbal comprehension, con- 
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analysis wit 
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hat similar results, identifying such 
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al foresight, ¢ snsitivity to p" x ni oe 

Asi ght, and sen oe of interesting cognitive styles have been 
side from abilities, a number °% ! 


described heorv of personality. From the psycho- 
scribe on inc stion W ) > ) 

Metri i. a ees he not been analyzed as thoroughly 
ric point of view, cognitive ) 


` m are rather narrow 
a8 ceo Mac A most of ther 
T a à ' Ac far as we 
trai RUME abilities. As far as 1 thods, but further work may 
alts, limited to certain materials an¢ 
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si hia more generality- 
caching methods interact wit 
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styles have 
ve know, 
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h developmental level: the optimal method 
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who have not. Some efforts have been made to demonstrate similar inter- 
actions for cognitive abilities and styles. Perhaps those who get high scores 
on adaptive flexibility, or on cognitive complexity, or on some other intellectual 
trait, would learn to solve problems best by one procedure and those who get 
low scores would learn best by some other procedure. The results to date 
have not been very encouraging, but this is a promising design that may 
someday justify such differentiated training procedures. Thus we arrive at 
the not very startling conclusion that the contribution of individual differ- 


ences to problem solving looks much like the contribution, outlined in 
Chapter 3, of individual differences to concept achievement. 
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atements 


| ogic and psychology grew up together, 
| sharing a common interest in rational 
Wea] | is thought; but as they matured, they special- 
iy = | ized and drifted apart. Logic, once con- 
cerned with the laws of thought, became an abstract 
d with formal relations between prop- 
ositions, rather than the thinking of them, and found 
n common with mathematics. Psychology, pushed 
volution, moved in with the biolog- 
hing the study of idealized 
al behavior and treating 
as natural human 


science concerne 


much i 
by the doctrine of € 
ical and social sciences, relinquis 


thought for the stu 
both rational and irr: t 
activities open tO empirical investigation. 

Today, the psychological study of logical thought is 
part of the study of problem solving, but the problems to 
be solved are logic problems or problems in which logical 
relations are assumed to be critical. This variety of prob- 


dy of actu 
ational thought 
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ing has certain distinctive characteristics, as we shall sce. Since the 
pened or eye f suck blems began, around the beginning 
psychological investigation of sash pro Sa e Somes 1938), the term 
of this century (see Burt, 1919; W heceler, 1958; oodwe ; “a mip aerate 
reasoning has therefore been used to distinguish this meng he NE 
problem solving, creative thinking, and other varieties. ui me aT 
was used at one time for the integration of isolated experience by 3 naai 
animals, such as the laboratory rat; but such a special meaning is far 
from the conventional one and need not be considered here. : i 
Some aspects of inductive reasoning were considered im the € bal Tes 
concepts; deductive reasoning is the topic of this chapter. Like other comple? 
activities, the two types of reasoning overlap. The 
may include testing deductions from a hypothesis, 
may include evaluation of the validity of 
statements. But the em 
chapter is quite 


achievement of a concept 
and deductive reasoning 
a rule generalized from a series of 
pirical psychological research summarized in this 
different from that of previous chapters. 


Logic and the Psychology of Reasoning 

A psychologist may need to know 
need to know a little 
tween logic 
defining the 
tion. 


a little logic just as he may, at times, 
chemistry or a little anthropology. The distinction be- 
and psychology must be respected, but logi 


vic is helpful here in 
problems of this chapter and making a start on their investiga- 


THE NATURE OF REASONING 
From the psychological point of view, we can say that reasoning is poly 
solving enlightened by logic. Thus reasoning can occur in the solution = 
many of the problems described in previous chapters, but it can be identifiec 
most unambiguously in the solution of problems of formal logic, such as a 
syllogism. Is the following argument valid? 

Some Y is Z; 

All X is Y. 

Therefore, some X is 7 


To anyone who has recently studied logic this will not be a problem. if 
he has memorized the rul 


l es for evaluating such syllogisms and has practiced 
their application to particular cases, he will be able 1 

the conclusion with no difficulty. This easy task is d 
the logical point of view, that is, i 
premises to a conclusion; but from o 
solving. To anyone wit 


to judge the validity of 
eductive reasoning from 
it consists of an inference from given 

ur point of view it would not be problem 
Š gan: hout logical training, however, this will be a problem, 
and his activities when faced with such a problem conform to our definition 
of problem soly 


o ving. Deductive reasoning, then, is the solution of problems 
about the validity of inferences, This chapter will also consider information- 
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y easy inferences, performed very rapidly, for 


processing tasks involving ver 
bute to the processes involved in genuine 


the understanding they contri 


reasoning problems. 
The distinctive feature of logic problems is that they are based on formal 


propositions. When we speak of “formal logic” or a “logical form,” we are 
referring to the structure or form of an argument rather than the content. 
The syllogism printed above, unlike anagrams, light switch, and two-string 
problems, is not concerned with any particular materials. Replacing X, Y, and 
Z by S, M, and P, or by animals, vegetables, and minerals, would not change 
the logical form of the argument or the validity of the conclusion. The two 
yithout particular content, and the question 


premises are general statements V 
material state of affairs but 


is not whether the conclusion corresponds to any 


whether it is a logical deduction from the premises. 
In this respect, logical reasoning is like mathematical reasoning. We can 


say that 3a x 4a = 12a? without consideration of the referents designated by 
these symbols. The distinctive characteristic of reasoning, both inductive and 
deductive, that gives it its tremendous power and engages every chapter in 
this book consists of letting symbols stand for reality and then manipulating 
them to solve problems ihat cannot be solved by manipulating the concrete 


reality. Just as the use of a symbol to refer to a class of objects is not true 
andard usage, the conclusion to a logical dis- 
a formal sense, as a logical consequence 


or false except in respect to s$ 
cussion is not true or false except in 
of the premises. 

Deductive reasoning may be a part of the solution of many problems, es- 
pecially in science, after the empirical data have been represented symbol- 
ically. In such cases, the success of the whole enterprise depends on 
Appropriate symbolization of the data and later application of the symbolic 
solution to concrete data, as well as on the deductive reasoning per se. Thus, 
in respect to efficiency of performance, formal reasoning has great potential- 

A ductive reasoning 1$ formal, 


iti : 

tles for transfer. But, precisely because de s à 

vant logical patterns are not easily perceived, and 
= o 


ay be difficult. 
y f the general language, in India as in 


arguments (Bochenski, 1961). 
s is a man; therefore, 
anyone with ordinary 
they have elaborated 
] language, rules for 
and a metalanguage 


Similarities between relev 
the transfer to concrete data m 
é Logic grew out of the resource 
Greece, probably as a technique 
he celebrated syllogism, All men @ ad 
Ocrates is mortal, appears in the form of sentences, a 


na / eloped 
ability can read it. But, as logical systems have developed, 
mbols that are not in the genera 


s that are not rules of grammar, an 
bout statements Like many other intellectual accom- 
Ou a = y 

general languag 
Plishments, these logical achievements depend on the ge guage but 


add to it. When someone uses letters to d standard propositions, or 

A : inary 

rearranges the verbal expressions of ee » pei 

j e 

orm, he has explicitly gone beyond har fan 

Citizen, untrained in logic, maY "i fo an inconsistency 

ordi a a . ‘or 2 
tdinary listening or reading tO Joo! 


es O 
for winning 
re mortal; Socrate 


Special concepts and sy’ 
evaluation of argument 
or making statements a 
Jesignate 
language into a standard logical 
ral language. The average 
but when he departs from 
or to examine the 
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‘ niente. ane aoe EAS 
validity of a statement as an inference from other statements, we say he 
ee i lved with the aid 

Many problems that do not look like logic problems are solved w it l - - 
of logical reasoning, but in such cases the contribution of reasonir g o 
obscured. The most clear-cut research on conditions that influence reasoning 

É ee ea 2 
has been done on problems that are frankly logic problems rather than thos 
in which the logical operations are hidden. 


ARE PEOPLE LOGICAL? 


A popular question in the last century, when man's kinship with oher 
species was being both proclaimed and denied, was whether man is a rationa 
animal. Are people logical? In past centuries, when logic was closer to the 
gencral language, it was part of the general culture. An educated man was 
expected to be logical just as he was expected to be able to ride a horse, 
cast up sums, and make a creditable speech. 

This old question can be put in its proper context by asking a parallel 
question: Are people astronomical? When astronomy was part of the general 
culture, an educated man was expected to be able to tell time by the sun. 


plot his course by the stars, and foretell which phase of the moon would be 
right for an amorous adventure. 


Today, when knowledge has diversified into so many 
expect all educated people to be masters of logic, 
specialty. But, avoiding the erudite developments of modern logic, it is a 
fair question to ask how well ordinary adults, such as college students without 
training in logic, can do on the more elementary aspects of logical reasoning: 
It is particularly informative to classify the errors that they make and to try 
to account for them psychologically. f f 

The motivation for reasoni 
a functional view, 


areas, we do not 
r 
or astronomy, or any othe 


ng should be considered briefly. If we take 
logical reasoning is one way, a particularly effective Way» 
of reaching a goal. In addition, logical consistency may be a goal in itself, 
or a snare to be avoided. Some people worship logical consistency, regardles 
of long-term goals, just as others enjoy the absurdities of their actions, ae ‘ 
inconsistency had a value also. The famous inventor, Charles F. Kettering 
once warned his colleagues: “Beware of logic. It is an organized way of going 
wrong with confidence.” A complete treatment of the motivation for thought 
would include esthetic valuations of spontaneity and asymmetry, religious 
preferences for direct, nonlogical grasp of truth, and the appeals of magic? 


symbols and rituals, all of which may, if nothing more, diminish the appe? 
of logical consistency. It 


« a ne 
: is safe to assume, however, that the ambience of ; t 

poy hologieat laboratory and classroom encourages rationality and that mos 
epartures from logical consistency under these conditions can be interprete” 
as errors rather than as the products of an alternative value system. Elsewhere 


the dynamics of nonlogical outcomes may deserve more subtle exploration: 


MATERIALS 
Logic supplies the materials for experiments on reasoning just as chemistty 
Supplies th 


e materials for experiments on smell and the dictionary materials for 
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experiments on anagrams. Logic also supplies a standard terminology for 
referring to these materials. 

Most research has been concerned with propositions, or statements with 
subjects and_ predicates; hence it is convenient for analysis to let S refer 
to the subject of a statement and P to the predicate. Conventionally, the 
four standard propositions are designated by vowels, as follows: 


A. Universal affirmative: All S is P 

E. Universal negative: No S is P 

I. Particular affirmative: Some S is P 

O. Particular negative: Some S is not P. 

These are called categorical propositions because they are statements about 
fers to the relation of inclusion. Thus 


categories or classes, and the copula is re 
ignated by S is included in 


the statement All S is P means that the class des 
the class designated by P 

Simple problems can be constructed from such propositions around the 
operation of conversion. Knowing that All S is P, can we conclude that All 
P is S? This question is put in truefalse format, but the open-end format 
is also used. What can we say about P, given that Some S is P? (The 
direct converse of an E or an I proposition is valid, but the direct converse 
of an A or an O proposition is not.) ; 7 ; 

Conditional reasoning has been investigated with conditional statements of 
the If... then form. If it rains tomorrow, I shall stay home. These are sym- 


bolized If P, then Q, and a variety of problems can be constructed, given that 
, Q, 3 
> 
P is true, or not true, and so 0n- ; : ‘a 
There are problems involved also in reasoning from inequauties, such as 
A is: larger tham Be Lineat syllogisms oF three-term series problems can be 
g e Li yllog 


assembled from two premises: Mary is shorter than Sophia, and Mary is 


taller than Rachel. Who is the tallest? 6 aie Ceta 
The 4 categorical propositions, _ E; 1, Q, ĉan easy 3 Ta IDE Ab A 
premises and A conclusion so as to construct a variety © syllogisms. The 
number of syllogisms thus constructed is not large, hence the universe of 
a” When such syllogisms are used as prob- 


syllogisms c be ex licitly defined. 
yllogisms can be exper’ - 
lems to be solved, this universe of problems ar be sampled more i 
e solved, he complete universe can be studied if 
tematically than most others, and the ¢ P d conditi 
desirable. T} ; beral three-term series problems and con itional reason- 
én able. ne SLE S 1 While the results obtained from a study of 
‘ g R is na sma g or construction problems might not 
a sample of 20 arithmetic, 4 age” a is s 
be Aarle from a study of another sample of 20 such problems, thas See 
Pling qu i tion does not arise in the study of logic problems. And'there is no 
g question a 


there are rules in the logic 


à . {oreovet 
aE of apparatus differen = to all problems. 
ooks for ascertaining the right ans Pai : ‘ 
A he sue e that logic problems enjoy, in comparison with other 
nother advantage D be obtained from analysis of errors. 


problems, is the information tha 


> item writing usually point out that 
A i ons 0! 1 p p 5 
lthough general ea as items can yield useful information about 


sty 38 seldom realized. Logic problems 


wrong opti in multip 
g options 1 : 
this opportu™ 


the causes of the errors, 
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i j k e logical 
written systematically, however, permit the subject to make the same in 
a x p a le 
error on different problems so that error tendencies above neg re 
i 3 e eT 
can be diagnosed. Many of the experiments to be discussed have there 
fe) d 


employed the multiple-choice format and have systematically tabulated the 
different types of errors made. 


METHODS 


Some of the early inquiries had subjects think about the san PN K i 
reasoning problem and report their thoughts. Since the difficulties o os 
method are now recognized, recent research takes advantage of the a 
jective methods of modern experimental psychology. Generally, the ee on 
response, whether valid or invalid, is treated as the consequence of multip e 
antecedent conditions. The aim is to identify these various conditions one 
by one. 7 , 

Certainly, the form of the argument, the combination of propositions, 1$ 
a likely possibility, and this is investigated by varying the logical form 
while holding the content constant. Typically, this is done by using leners 
such as X, Y, and Z, or A, B, and C, as the terms of the propositions. The 
frequency of valid and invalid reasoning for each logical form can thus be 
studied and, more important, the logical forms that lead to invalid reasoning 
can be specified. d 

The content of the argument is another likely influence on reasoning an 
this is studied by varying the content while holding the logical form constant: 
The content may be letters, nonsense terms, such as planti and glubs, oF 
meaningful words, such as angels and zinnias. When the terms are words of 
resemble words, the statements need not be printed formally, as above, but 
can be printed as ordinary sentences embedded in continuous text. From 


ms 2 . . . i i ns 
the standpoint of formal logic, which is independent of content, variatio 
in outcome that are correlated w 


ith variations in content are nonlogical. 
Most questions about reasoni 


ng are not questions about the effects of i 
course in logic. They are stated rather in respect to the ordinary intelligen 
adult, hence subjects with training in logic are usually eliminated. Similarly. 
errors are more psychologically revealing than correct judgments, en 
many Investigations of syllogistic reasoning have been based on indeter 
minate syllogisms, the correct response to which is None of these. Deer 
minate syllogisms are often interspersed, however, to retard the develop- 


g ing 
ment of a response set and also to supply a measure of general reason!Me 
ability. i 


General Task Variables 


ae 
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Wf llogi ri i iliar terms are somewhat 
two (Anderson, 1957). Syllogisms written in familiar = : 
easier than syllogisms of the same form written in letters or un ee oi 

y Hog i ' : : 
(Wilkins, 1928). Abstractness is probably also important, ha a Taan 
propositions are intrinsically abstract, and this circumstance makes all forma 
reasoning difficult. 
d n - , 
Boredom is a general factor in reasoning problems, as m be poba 
s iti igi 4 stain 
Exciting arguments about politics, religion, and sex may Peer a 
but s i , in 
hours, but several investigators have noted that ree ee sre A a 
arguments written in terms of A, B, and C, and repeated in f D, E, 
i i j tho have a special interest in such 
and F, are monotonous. Those subjects who ha p kgs eee 
abstract srials and those who can force themselves to yze eac 
aaien woth ber i . environmental conditions that maintain 
proposition will be more successful; envir 


alertness will also be helpful. 


The Difficulties of Syllogistic Reasoning 


sms are assembled Ir sitio’ P ise: other as 
Syllog d ns as premises and an 
i gis ‘ ssem from two prop sit : 
y + E * sae = ersa ive ropositions would form the follow 
a conclusion. Three univ rsal affirmative p P : . 

eism, lV g s! arv symbo for the m ddle term: 
ing syllogis n, M bein the customary s) 


All S are M; 
All M are P. 
Therefore, all S are P. 


These syllogisms, taken as they me be asked to evaluate each conclusion 

fre False format; the subject ern ye — of the truefalse format is 
; pene inci isadvantag 
as valid or invalid. The principa i e manufactured 
“4 =a poss oe E response set When rr see i 
possible operation of a res "ations of propositions, many 
k ai combinations > han half 

systematically, from the various à invalid to more than ha 
are weri h valid. Thus anyone who Sn pit E aaan 
the ‘éhlems aiher because he has a genera ne of logical tricksters, will 

E S, C E logic has made him war) g 
or because a smattering OF S 


iti increases 
' orimental condition that incr 
i Likewise, any experimen f right answers 
get more right answers. Likewise TY i proportion of right answers. i 
undifferentiated caution will incre aE Dio agian oe oe oe 


: be P i 
Alternatively, syllogisms ©” E, I, and O propositions as possible 
multiple-choice format by writing that follow, then select the logical con- 
nises tha 
conclusions. Read the two pre™ 
clusion: 


Some S is M; 

All M is P. 
Therefore, 

1. All S is P 

2. All S is not P 
3. Some S is P 
4 
5 


can be considered problems in the 


. Some S is not P 


ý z valid. 
. None of these conclusions is va 
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In comparison with the true-false format, the multiple-choice = Ei 
like a giveaway since all conclusions, valid and invalid, are epey a i 
But these problems are dificult for educated adults without forma taining 
in logic; the number of errors is large. And it is in fact the ipai oa 
analysis of these errors—that is, the invalid conclusions that are accepted as 
valid—that constitutes the principal advantage of this format. The aim is 


to tabulate and describe such errors and to explain, in psychological terms, 
why they occur. 


EFFECTS OF CONTENT 


In addition to general task variables that make some arguments hard to 
follow and thus increase errors in judging the soundness of the conclusions, 
there are more specific and powerful effects that arise out of the content of 
the argument. As we noted in Chapter 2, the introduction of thematic con 
tent into concept formation experiments disrupts the logical information- 


processing strategies observed with meaningless geometric figures. Although 
this lead has not been followed up in inductiy 


deductive reasoning have been studied in rel 
clusion and cognitive inconsistency. 


e reasoning, content effects in 
ation to attitude toward the con- 


ATTITUDE TOWARD THE CONCLUSION 
When the course through an argument to its conclusion is hard to follow, 
the reader or listener is tempted to rely on his attitude toward the conclusion 
itself rather than evaluate the v 


alidity of the whole argument. In order to 
demonstrate this effect, Janis and Frick (1943) resorted to syllogisms with 
familiar terms printed in short paragraphs like ordinary text. The instruc- 
tions to the subjects, particul 


a A a o first 
arly important in these experiments, and the firs 
two of their syllogisms follow. 


Instructions: This is a test in reasoning. You are to check each of the following 
arguments as “sound” or “unsound.” A sound argument is one in which the con 
clusion follows logically from the premises. (Note: “All” in logic means gacli 
and every case. “Some” means at least one and perhaps all cases.) 5 
No Bolsheviks are idealists and all Bolsheviks are Russian. Therefore some 
Ussians are not idealists, 

The Eskimos are the only people w 
all Eskimos h 


ave good teeth, 
meat have bad teeth [p. 74]. 


i a at 
ho eat nothing but meat and it is found mi 
So we may conclude that no people who eat 0m) 


pe judging 16 of ach subject reread the ee 
m J ; : e 
ae ‘was asked agreed or disagreed with each one. Thus a 
h gments of the conclusions could be sorted into four types: agree-valid, 
-invalj : i * ) 2 nate bias 
en agree-invalid, disagree-valid. The effect of the subject's bia 
wo Jenis A : 5 od 
i it ‘ yer oe him judge the conclusion valid when he agree 
with it and invalid when he dis ah : he pro 
disag 7 z S ‘rease the 
penton of Silemews - agreed with it, and thus to increase a 
Sal h 1 S in the first two of the above types and decrease 
opor a ; n 
proportion in the last two, Even though the subjects, who were graduate stu 


these syllogisms, e 
whether he 
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dents with no formal training in logical analysis, had only two alternatives to 
choose from, they made errors on 23 percent of the judgments and, as ex- 
pected, these errors were preponderantly of the agree-valid and disagree- 
invalid types. With subjects of less education and interest in abstract things, 
the difference would probably be even greater. 

One interpretation of this experiment is that the emotional content of the 
syllogism interferes with logical evaluation, so Lefford (1946) deliberately 
manipulated this feature. He asked several groups of college students to judge 


the validity of syllogisms, instructing them “to judge whether the conclusions 


arrived at in these passages are warranted by the statements given in support 
th controversial 


of them [p. 129].” Half the syllogisms were concerned wi 
issues, as in the two examples below: 


War times are prosperous times, and prosperity is highly desirable, therefore, 
wars are much to be desired. 
Se marh. © be daa geas AI GIO leadis pave#s 
all CIO leaders are agents for communism LP- 1]. 

s of the validity of the conclusions 


ificult, even for college students, 


dical ideas, therefore, 


The instructions called for judgment 
from the premises. Such judgments are 
and the scores were quite low- 

The other half of the syllogisms were matc 
Was noncontroversial: 


hed for form, but the content 


and all human beings are fallible; therefore, phil- 


h live in water too; therefore, all fish must 


Philosophers are all human, 
Pie are fallible too. 
whales live in water, and all fis 


be whales [p. 130]. 


T d approximately the 
hes - form as the others ane ap 
sm same 
e syllogisms, of the pp uch more often by college students. 


same ] fr , 
ength, were evaluate! eonig compounded by the intrusion 
a d . 
h makes it even more difficult for the un- 
trained person to play the sy e straight and evaluate the argument 
play y 1108) 
19) 
n purely formal grounds. fee 
The terms used in these early Hes jat some wer 
ts ye the experimenters as n saben d with unfamiliarity, in- 
onal, and i have be : A 
‘actors ma 5 
ò , and the like. remise on a scale of truth- 
this lioht dent: 
ig . uden Gi a 
Rite a a ore mene sotional premises were rated significantly aea 
, and found that the € mises. Hence, the 
to the false end of the scale than must have been confounded 
effect that Lefford attributed to emotl bs discussed below. 
with th Sear Ý i y 
e effect of cognitiv > ‘ith attempts to show how 
This approach to “deductive reasoning ia tie and sea oh experiments, 
Wishes “distort” beli tions “bias M% g ws 
istort” beliefs or €™° ucated people and citizens 
as well + constr IIs, have show? that ed peop : 
as public opinion P% ® y recent approach admits that 


at t The more 
large are susceptible t 


not carefully standardized. We 
e controversial or emo- 


o this effec! 
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both personal desires and logical reason influence acceptance ofa evsyeliaion 
and studies the interaction of these two influences. An example of the ai 
inclusive approach, the “logic model” of McGuire (1960) moved away sram 
the classical two-valued logic, which considers onh propositions that ae 
either true or false, to study ratings of the probability that a proposition is 
true. McGuire began by asking people to estimate the pr babilities of prem 
ises and conclusions, and also to rate the desirabilities of cach. Good reason- 
ing, according to this model, requires that the probability of a conclusion be 
quantitatively related to the combination of the probabilities of the premises. 
Research with high-school and college students disclosed a moderate relation; 
even on controversial issues these people were somewhat logical. A moderate 
relation was also found between acceptance of conclusions and ratings of de- 
sirability of premises, which can be interpreted either as the influence of 
desires on beliefs (wishful thinking) or of beliefs on desires. 


A more subtle logical influence was demonstrated by McGuire when new 
information raising the probabili 


es of minor premises resulted in higher 
estimates of probabilities of logically related but not explicitly mentioned 
conclusions. Then a test of McGuire's logic model with more advanced col- 
lege students (Dillehay, Insko, and Smith, 1966) confirmed the usefulness of 
this approach in general but found somewhat less w ishful thinking. Logical 


consistency was demonstrated in that, following changes in beliefs due to 4 
persuasive communication, si 


gnificant changes occurred in logically related 
but unmentioned beliefs. 


But consistency, like other abstract criteria, means different things to dif- 
ferent people. College students today 


x synings 
do not know the technical meaning 
of such logical terms as contradiction 


A airs 
and contrary, and when evaluating m 
of statements on a single topic, they often justify apparent inconsistencies PY 
mentioning the virtue of being democratic and open-minded CLuchins a! 
Luchins, 1965), 
A glance back at trends of rese: 
clusions—only illustrated here by 


historical twist. The early research was designed to show how emotions, att! 
tudes, and prejudices can bias the acceptance of a conclusion to a logic’ 
discussion. Then more quantitative studies show that both the logical struc 
ture of the discussion and biases toward the content of the conclusion must 
be considered. In fact, it is those who score high on an ordinary test of re% 


st . z co . . p i 
oning ability, with noncontroversial conclusions, who are most able to a 
count their own biases w 


à % >Ja i -lusion 
hen judging the validity of controversial conclusio 
Feather, 196 1 s 
CFeather, 4). Recent 
theories of attitude 


és sii on- 
arch on attitudes and acceptance of ¢ : 

: "i Swa CUriOUS 
a few examples—brings into view a cut 


consiste iti n , 
: "CY or cognitive congruity. No one claims that beliefs are accepte 
rejected, and modified by rat; f man 
which Ropes f y rational processes alone, but any model o! 
$ ) 

Pes tor completeness must take these processes into account. 
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COGNITIVE INCONSISTENCY 
Regardless of bias, the terms of a proposition may be consistent or incon- 


sistent. We do not call a person biased if he likes the good and dislikes the 
evil, but the proposition All good things are fine may be easier to deal with 
than the formally equivalent All evil things are fine. Frase (1966a) ap- 
proached this question by having college students rate a number of adjectives 
on the evaluative scales of the Semantic Differential. With the aid of these 
ratings, he assembled propositions in which subject and predicate were com- 
patible in that the difference between evaluative ratings was small. A syl- 


logism made up of three such propositions follows: 


All sinful things are cruel, 
and all disgusting things are sinful. 
Therefore, all disgusting things are cruel. 


Equally compatible negative propositions were constructed from two terms 
with large differences. Moderately and highly incompatible syllogisms were 
also constructed in this way; an illustration of the latter follows: 

No natural things are creative, 
and no natural things are successful. 
Therefore, no successful things are creative. 


This semantic inconsistency did increase time and errors in judging the 


validity of conclusions to syllogisms. 
i ement in that the terms were stand- 


The Frase procedure was an improv 
a statement constructed from 


ardized by students’ ratings, but the effect of 
a combination of subject and predicate terms may not be accurately mea- 


sured by combining ratings of the component terms. In fact, a later study 
CFrase, 1966b) found only moderate correlations between the incompatibility 
index and belief in a proposition. The question of additivity of judgments 
of terms will be taken up more systematically in Chapter 8. 

Turning to a more direct procedure, Parrott (1967) had college students 
rate the truth of complete statements of varying content on a scale of 1 to 7. 
Ile then assembled these in 3 tests of 15 syllogisms each, equated for logical 
Structure, 1 composed of true premises Cratings of 1 to 3), 1 of false premises 
Cratings of 5 to 7), and 1 of mixed premises. Each syllogism was followed by 
5 statements, one of which was @ valid conclusion to the syllogism. The re- 
sults showed that, in relation te gyllogisms with true or mixed premises, 
those with false premises required more time and were evaluated more ac- 
curately, Furthermore, subjects given syllogisms with false premises first, 

: reduced solution time on the second test; but subjects 
he reverse order did not show this re- 
at the cognitive inconsistency of 


then true premises, 
given the two types of syllogis 
duction. One plausible interpret: 
the false premises operated as a c 
logical problem. 

To test the cognitive 


ms in t 
ation was th 
hallenge and a reminder of the formal 


inconsistency interpretation, and others, Parrott 
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(1969) analyzed reasoning time in more detail and included — i 
content-free syllogisms, written in terms of letters such as X, Y, and Z. / s an 
additional control, some subjects were required merely to read the + ae 
and, as Table 6.1 shows, the time spent reading two premises and five alter- 
native conclusions was about the same whether the premises were true “a 
false. But reasoning, directed toward selection of the valid conclusion, st 
more time when the premises were false. In comparison to syllogisms with 
true premises, those written in symbolic terms required much less time s 
reading but only slightly less time for reasoning. Exposure of premises and 
conclusions was under the subject's control, so it was possible to record that 
syllogisms with false premises necessitated much more switching back from 
conclusions to premises than the other two ty 

It is apparent from these results that reas 
that the thinker processes the information so 
the information processing is influenced by 
transfer experiments were arranged, by 


with one type of content to syllogisms of another type, some positive and 
some negative transfer effects were obtained, but the hypothesis that the 
cognitive inconsistency of the false premise 


reasoning was not supported. The clearest 
with the symbolic syllogisms, wh 


pes. : 

oning is more than just reading: 
mehow after he reads it, and that 
the nature of the content. When 
shifting the subjects from syllogisms 


s would improve later syllogistic 
2 ing 
transfer effect followed reasoning 


ich Parrott regarded as evidence for a struc- 
tural insight interpretation. That is, the content-free syllogisms gave the sub- 


jects the best Opportunity to master the syllogistic structures and apply them 
quickly to other syllogistic problems. 


PSYCHOLOGICAL ANA 
Wallonians dance the 
Therefore, it is obvious 
Some Republicans have 
publicans. Hence w 
Logic books ide many types of logical error, and 
deb; e other side's arguments. Among 
oe above, which is the fallacy of the undis- 
tributed middle term, and the second, which bases a conclusion on te 
quency of logical errors in the syllogistic 
of course, a question for quantitative Te 


LYSIS OF LOGICAL ERRORS 

polka. My worthy opponent dances the polka. 

that my worthy opponent is a Wallonian. 
inherited oil wells. Some Wallonians are Re- 

e know that some Wallonians have inherited oil wells- 


TABL 
E 6.1. READING AND REASONING TIMES GN seconps) 
FOR THREE TYPES OF SYLLOGISMS 


Task Symbolic True 


False 
Reasoning 21.9 24.2 29.4 
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search; we know that college students are quite susceptible to the fallacy of 
the undistributed middle term and that they often draw invalid conclusions 
from two particular premises (Wilkins, 1928). Such errors are of genera! 
extralaboratory concern to all educated people. The analysis of the errors is 
complicated, as in other kinds of problem solving, because different errors 
occur together, but several years of research have achieved some degree of 


clarification. 
THE ATMOSPHERE EFFECT 


All S is M; 
All P is M. 


Therefore, all S is P. 

Labeling this the fallacy of the undistributed middle is not an explanation 
of why intelligent adults are so susceptible to it, but a psychological hypoth- 
esis has been offered by Woodworth and Sells (1935) under the name of 

4 does not know the rules and 


atmosphere effect. The untrained reader who 
cannot follow through the abstract implications seizes on nonlogical verbal 


cues. The word All in the premises suggests a universal afirmative atmos- 
phere, so when the reader sees the same word in the conclusion, he accepts 
it. Similarly, the word Some in particular premises wafts a favorable atmos- 
phere around a Some conclusion, making it easy to accept the conclusion 
that Some Wallonians have inherited oil wells. In general, AA premises 
create a favorable atmosphere for acceptance of an A conclusion, EE premises 
an E conclusion, H premises an | conclusion, and OO premses an O con- 
clusion, whether logically valid or not. To take care of mixed premise com- 
binations, Woodworth and Sells added two supplementary hypotheses: 
C1) A negative premise creates atmosphere, even when the other 
scant Oe ait p a particular premise creates a particular atmos- 
premise is affirmative; (2) a particular Pre’ ‘ 
phere, even when the other premise is universal. When Sells (1936) pre- 
n in terms 0 x, Y, and Z to college students and 
ed with the predictions fairly well. 
familiar words as well as letters— 
ative conclusions—also 
effect (Morgan and 


a negative 


sented syllogisms writte 
tabulated their errors, t 
Similar research using SY 
with the valid conclusion to be 
confirmed most of the predic 


Morton, 1944). 


he results ag) 
Jlogisms written in 
he chosen from five altern 
tions from atmosphere 


INVALID CONVERSION ing is Y 
The error of invalid conversion js a common one. ine pe pact A isi Y 
iE te a3 hat All Y is Xx. Knowing that Some is not Y, it is easy 
ae T X. Bot _nferences are illogical, yet ordi- 

x pr oblems adequately succumb frequently to 


to say that Some Y is not ¢ 
nar P ir daily 
y adults who solve their daily French adults were asked to convert the 
valid conversion, such as 


this fallacy. For example; ele 

st 3 ional, only 

a All men are ang 3 , but many wrote statements like 
e class of rational beings (Oléron, 1964). Woodworth and Sells 


Rati ig of man 
tionality is the property a Palid conversion as a special case of atmos- 


(1935) considered the erro" 
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phere = Inferring from All X is Y that All Y is X was attributed to the 
ere. 
oo of the invalid conversion arises partly from the —— 
between the reading of conventional sentences and formal categorica pr 1 ae 
tions. To say that All right angles are 90° angles is equivalent to saying t i 
All 90° angles are right angles. But this is not exactly what the erh 5 - 
categorical proposition means. When the statement All S is P is rewritten ; j 
S C P and read as S is included in the class P, invalid conversions can be 
expected less frequently. . 7 
This insidious error of invalid conversion, whatever its basic origin, has 
been invoked by Chapman and Chapman (1959) a : 
tion for errors in syllogistic reasoning previously explained by the we 
phere effect. Acceptance of the invalid conclusion to the syllogism pane 
above, for example, might be due to the atmosphere generated by the ae 
tion of All, but it might also be due to conversion of the second premise. 
All P is M is converted to All M is P, the conclusion would follow logically. 
Conversion of E and I Propositions is valid, but conversion of A and 
Propositions could explain the acceptance of many invalid conclusions. Fe 
man and Chapman tried out this hypothesis by giving syllogisms in multip'e 
choice form to college students and found that they could thus account we 
large proportion of the errors. While the atmosphere effect is completely 


á 5 r n z ai hat 
nonlogical, invalid conversion seems to be an attempt at logical reasoning t 
goes awry. 


an alternative explana- 


DIFFERENTIAL ANALYSIS AND TRAINING snd 
Three independent investigations of syllogistic reasoning (Chapman tk 
Chapman, 1959; Morgan and Morton, 1944; Sells, 1936) agreed fairly H f 
in identifying the invalid conclusions that were most frequently accepted by 


3 x the 
educated adults. The explanations of these errors were different, but 
procedure of tabulati 


hg errors does not by itself furnish the data for a gho 
between the alternative explanations. One possibility for separating the oe 
explanations would be to use separate training procedures for reducing erro" 
designed around each explanation, | 
To explore this Possibility, Simpson and Johnson (1966) first cane 
two separate scales of syllogism problems written with letters, (such as ®? 
Y, and Z) in multiple-choice form, one for testing atmosphere errors only ao 
one for testing conversion errors only. The atmosphere effect was simplifie s 
and restated as follows: “ 3 es of a syllogism contain the same 
qualifier, are or are not e allor some, many subjects W! 
ch also contains the common term [p. 197].” A i 
i is was fairly adequate in that a 
70 percent of the errors were the predicted atmosphere errors. Another sca’ 
made of AO syllogisms 
that about 60 percent 
Reliability 


of the errors 
> je 
coefficients were .64 for the atmosphere scale and .80 for the c° 
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version scale. Justification for treating the two errors separately came from 
the correlation between scores on the two scales, which was only =I. 

Brief training was arranged for different groups of college students: training 
specifically directed against the atmosphere error, training specifically directed 
against the error of invalid conversion, and general training as a control. The 
results showed that atmosphere errors were significantly reduced by anti- 
rsion training. The results for con- 
version errors were not so specific. These errors were correlated with errors 
in gencral reasoning, and they were reduced by general training as well as by 
s experiment demonstrated that both errors 
lone would account for all 


atmosphere training but not by anticonve 


anticonversion training. Thus, thi 
could be identified and that neither hypothesis a 
the results. 

Since atmosphere and conve 
tion of syllogisms written in t 


rsion errors occur so frequently in the evalua- 
erms of letters, it is an interesting question 
whether these errors can be identified when the same syllogisms are written 
To approach this, Stratton (1967) wrote syl- 
h novel terms similar to those a student might 
nosphere and conversion scales were built in, 
syllogisms. Two examples follow, acceptance 
j and acceptance of No. 4 


as ordinary continuous text. 
logisms in paragraph form witl 
find in a science textbook. Atn 
as well as a scale of determinate 


F a z : g 
of No. 3 in the first indicating an 
] conversion. 


atmosphere error, 


in the second an error of invalid 
natural petroleum resources become more depleted, 
the securing of petroleum from unconventional sources becomes more imperative. 
O § or p ds of northern Alberta Canada. Since 

ne such source is the Athabasca tar sands 0} 7 hong a. ` : 
some tar sands are sources of refinable hydrocarbons, m leposits a wori y 
of commercial investigation. Some kerogen deposits are also sources of refinable 


As technology advances and 


hydrocarbons, 
herefore: 
All kerogen deposits are tar sands. 
tar sands. 


No kerogen deposits are 
Some kerogen deposits are tar sands. 
Some kerogen deposits are not tar sands. 
None of the above [p- > 


vbw -AF 


ly in cool weather, are all 


which open onl: 0 
d only in damp areas, are 


The . “i 3 N ina 
elic as rgentina, ; 
: delicate Glorias of ATS: Fragilas, foun 


Sassoids, Some of the equally delicate 
not Glorias. 

hat can you infer from these statements? 
All Fragilas are Sassoids. 

< No Fragilas are Sassoids. 

- Some Fragilas are Sassoids- 

- Some Fragilas are not Sassoids- 


None of the above [P- J. i 
nd conversion errors were fairly adequate; 
a 


uA wN e= st 
3 is 


pee 1.73 respectively- Most of the errors were 
5 and -/- 7 
ï each scale. 


These scales for atmos 
Telj eRe x a 
Cliability coefficients were - 
t is 

le predicted ones, 62 percent © 
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Such paragraphs, with new terms and irrelevant information, appear w z 
harder to follow than syllogisms of the same logical form written in eten 
terms, and more errors might be expected. On the other hand, they ii 
more interesting, more challenging; verifying the truth of the conclusion 
seems to be more worthwhile. Actually, the errors were similar, both in 
frequency and in kind. On paragraphs that included an atmosphere oa 
about 75 percent of the responses were errors, mostly atmosphere errors. z 
paragraphs that included a conversion error about 90 percent of the response 
were errors, mostly conversion errors. We should not jump to the conclusion 
that these college students were especially irrational because only 25 percent 
of their judgments of the determinate arguments were errors. The i te 
frequencies were no higher, perhaps a little lower, than those reported by 
Simpson and Johnson for syllogisms of the 
content. But, unlike the former study w 
between the two kinds of error, this stu 
training intended to reduce the tw 
gible improvement. 


same form but less meaningful 
hich reported a correlation of mbd 
dy found a correlation of .38. Bricf 
0 errors differentially produced only negli- 


A STEP TOWARD RECONCILIATION 
Recently, Begg and Denny 
posing hypotheses by rest 
two supplementary 
inclusive principles 
vious statements. 


(1969) attempted a reconciliation of the pa 
ating the atmophere hypothesis—but retaining ws 
Principles of Woodworth and Sells—in the form of ne 
which summarize the errors more completely than pe 
Their first principle states that whenever at least om 
premise is negative, the most frequently accepted conclusion will be negatives 
when neither premise is negative, the conclusion will be affirmative. Their 
second principle states th 
most frequently accepted 
is particular, the conclusi 

The frequencies of er 
syllogisms—which diffe 


at whenever at least one premise is particular, ae 
conclusion will be particular; when neither premise 
on will be universal. sical 
rors made by 33 college students on the 14 grimca 
red slightly from the frequencies reported by others- 7 
most frequent error was the one predicted by iie 
principles evidently describe the dara: d 
] 2> Ciminating the need for the hypothesis of imwa é 
Son . is i tion by envelopment, but the authors T 
i he agreement of the principles with the error ea 
quencies does not Support any Specific hypothesis about reasoning procesie” 
half-dozen studies are brought together, it appears 
that atmospher d conversion errors can be separated if necessary» 
pothesis can be stated to include the conversion vei 
nversion hypothesis could be similarly expanded to include a 

atmosphere errors, but this would be more difficult. Yet it is still true thah 
errors of illogica] conversion are common in the evaluation of single propos 
tions. Does this mean that when Propositions are combined as syllogisms, me 
tendency toward acceptance of an invalid conversion evaporates? Or do 
it mean that the invalid conversion is basically an atmosphere effect, 


es 
as 
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Woodworth and Sells suggested long ago? The fact that brief anticonversion 
training by Simpson and Johnson (1966) failed to reduce conversion errors 
offers some indirect support for the atmosphere hypothesis, but more direct 
evidence would be desirable. 

When propositions are combined to form syllogisms, the order of the 
propositions might be a factor in judgment of the conclusions. Premises in 
the AO order, for example, might not have the same psychological effect as 
in the OA order. Although this order effect has not been 
ata from the investigations mentioned 
of the terms of the proposi- 


the same premi 
seriously investigated by itself, the d 
above suggest that it is negligible. The sequence 


tion is another matter. 


THE SEQUENCE OF TERMS 

The terms of a syllogism can be arranged in four sequences, known as 
figures, which are logically equivalent but may vary in difficulty. The four 
figures of a ‘Mogism—all invalid—constructed from particular affirmative, 


or l, propositions, are as follows: 


First Some X are Y. 

Some Z are X. 
Therefore, some Z are Y. 

Second Some Y are X. 

Some Z are X. 
Therefore, some Z are Y. 

Third Some X are Y. 

Some X are Z. 
Therefore, some Z are Y. 

Fourth Some Y are X. 

Some X are Z. 
Therefore, some Z are Y. 

According to Frase (1968), these figures correspond to the mediational 
paradigms of paired-associate ‘learning which are known to vary 1n difficulty. 
A syllogism of the first figure, for example, is analogous to a eam = 
task in which the subject is required to associate X-Y, then Z-X, and finally, 
in the test condition, to learn the association Z-Y. Thus, this syllogism ied 
Ponds to a forward chain of associations, ZXY, which should be ane 
easy to evaluate. Syllogisms of the fourth figure correspond to a bac e 
chain, which should be difficult. Experiments requiring judgments of t : 
validity of 11 syllogisms, written in X, Y, and Z and arranged in oe 

gures, revealed snail differences in time and errors that gave some bor er- 


line S 
support to the hypothesis. o , 
- for the mediated association interpretation came 


and third figures by Pezzoli and Frase 
which in the above illustration is 
alent syllogism in that 


t ore convincing support 
rom a comparison of the second 
(1968). If we think of the middle term, i 

+ as a mediator, the second figure is a stimulus-equiv: > 3 
both subject and predicate of the conclusion precede the mediator in the 
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7. 


i fe oia seca spe in that 
premises, thus: po The third figure is a response-equivalent ty] 


! sO. See he 
niediator precedes both subject and predicate terms, thus: XC, « Sine 


i adi r, it ci roduce 
third figure requires different responses to the same mediator, it can pr a 
z i i i g is younesis, 
interference and hence more errors in reasoning. To test this hyy 
syllogisms were written in ordinary English, such as: 


No grass is red. 
All grass are plants. 
Some plants are red. 


As predicted, college students made significantly more errors in judge 
the validity of syllogisms in the third figure than the second. Even gine 
convincing was the interaction with associative strength. While there ae 
no difference between second and third figures for syllogisms written in tors 
of nonsense syllables or in English words of low associative values, there 
was a large difference between the 
in which the second term w. y associated with the first. of 

This effect of the sequence of terms is stated in respect to frequency i 
total errors, rather than frequency of specific errors, as in the research f 
atmosphere and conversion effects: but predictions about the acceptance O 
specific conclusions 


w te wwe hare 
can probably be stated and tested. As it is, we 
another factor that influences the difficulty of 
with closely associated terms, 


i p əmises 
two figures for svllogisms with pren 


as close 


a š senis written 
reasoning for syllogisms w 


The Solution of Three 


Three-term series proble 
in intelligence tests for h ad 
problems has been direct sses by which they are solved. 
The terms are usu John and Mary, and the relation z 
tween them is an į V, such as er than or happier than, or some ot ia 
asymmetric relati as below or in froni of. When the 3 terms are N 
» as a problem, with a conclusion jed, 
ms, or, if negative premises are incluc a 
: 2). Comparisons of time and errors when terr T 
relations, and sequen are systematically varied have disclosed ar 
vhich ingenious hypotheses about <a 
and psycholinguistics have been elaborate 


-Term Series Problems 


A 4 sen included 
ms, also called linear syllogisms, have been incl 
alf a ce 


ment of premis 


es, spatial imagery, 
Consider the £ ee 


ollowing: 
John is taller than Mary; 
Mary is taller than David. 
Is John taller than David? 
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Many people solving such problems can be observed drawing little charts 
on paper, as at the right, or making gestures with their hands 
to indicate relative heights. Many report that they utilize some John 
imaginary spatial representation, resembling the chart. Thus, 
problems stated in a relation that is easily visualized, such as Mary 
taller than, were easier for English schoolchildren at age eleven | 
than problems stated in warmer than or happier than. This differ- David 
ence in difficulty was not found at age sixteen when symbolic 
skills have presumably replaced visualization (Hunter, 1957). 

Now consider another example: 


John is taller than Mar 
David is shorter than Mary. 
Unlike the first problem, which is so straightforward that the thinker can 
almost read the conclusion directly from the premises, this one requires con- 
Version of the second premise before the logical implications are evident. 


Here is a third example: 


Mary is shorter than John; 

David is shorter than Mary. 

not conversion of terms, but reordering of prem- 
ise in front of the first. According to Hunter 


(1957), the problems that require conversion take longer than the straight- 
Orward ones, and problems that require reordering of premises take even 


longer. Problems that require both are the most difficult. 


This one seems to require, 
ises, placing the second prem 


SPATIAL PARALOGIC 
If the solution of such problems involves representing abstract relations 
Y a spatial ordering, more specific predictions of the relative difficulty of 
i A De Soto, London, and Handel (1965) 


te Problems should Be penible and compared problems 


ormulated t incipl account for difficulty 
wo principles to a 4 zi z , : 
y the number P red coreey in 10 seconds. The directionality principle 


States that an ordering from better to worse is easier than one from worse to 

a ’ à 
etter, Problems were constructed in terms of men’s first names, the question 
cing th . A and C. In accord with the directionality prin- 
“Ing the relation between A 4 4 D we hener deen C wae 
ciple, problems of the form A is better than B and B is better t 


Solved 5 l dents in the time limit, while those in the 
7 ege stu 
SET pommel se than A were solved by only 43 per- 


orm i } is wor: 
cent, C zi serie ne tne seh has omahing in einem with 
the end anchoring of judgments, to be discussed in Chapter aie oie at it is 
elpful to the subject if the first term is an end a a he or es me 
80 that the premise proceeds from an end towar ; e n a er than 
from the middle toward an end. Thus A is better than m : is sate 
than B was solved correctly by 62 percent while B is better than C and B is 
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worse than A was solved correctly by only 38 percent. The D a: 
success for other problems in which one principle helped w hile the ot Fa 
hindered fell between these extremes. These two principles of spatial paralogic 
therefore seemed to agree quite well with the data from problems written i 
better and worse, even though the subjects were college students capable o 
symbolic operations. Sagem 

To get more direct evidence of the spatial representation in cognitiv spara 
De Soto, London, and Handel had their subjects locate the elements o! 
comparative statements on paper, in boxes printed horizontally and vertically. 
As expected, better was consistently represented above worse, while the rep 
resentations of lighter and darker were spatially inconsistent. PES 

Such findings lead to predictions about the difficulty of different relations, 
so De Soto, et al., compared problems written in better and worse with prob- 


: 4 i 
lems of the same forms written in above and below. As expected, the pe 
centages of success were similar because 


these problems are represent 
similarly in cognitive s 


oat ela ‘cos and 
pace and so the difficulty of aligning premises a 
s ; y k m otter were 
conclusion is about the same. For example, problems written in better we 
easier than those written jn worse, 


5 P 1° vere 
and problems written in above w 
easier than those written in below 


because, according to the authors’ we 
pretation, the natural tendency is to represent these relations spatially ape 
proceed from above downward. “Worseward is downward [p. 518].” The par 
tern of percentage correct was not the same for problems written in nge 
and darker, however. The problem A is better than B and C is worse than c 
was much easier than the parallel problem A has lighter hair than B mib 

has darker hair than B. Apparently the clear spatial representation of better, 


R —eliminate? 
worse, above, and below—especially when the ends are anchored—elimina it 
the presumed difficulty of the revers 


appears that the third term to be re 
tured im 


ue spectively 

al of one proposition. Introspective’) ne 

. . jg struc” 
ad is simply added to an already 


s ; $ ial imagery 
age. Problems written in terms not conducive to spatial imager) 


> ar versal in” 
must be handled indirectly, presumably linguistically, hence a reversal 

creases the difficulty. f 4 

aes ; s ni ors 

Problems written in left and right did not conform very well to the auth 

, : D a con 

eer about difficulty. One might guess that in these cases the i e 

usio j i : ; i introduc 
fusion etween a spatial representation and reading habits could intro 

inconsistencies, 


, y with 
There are other abstract relations that can be represented spatially, “ 
the expected effects o 


A 
don, and De Soto (19685 ove follow-up experiment by Handel, Lo 


ly, as the better-worse relation was. 5 aii 
horiz i 1 others ag@¥" 
orizontal representation, and 


oe age and farther-nearer, were inconsistent. As expected, the “nen 
a reulty of linear syllogisms written in father and son, better and, S le 
and more and less were all about the same. In the authors’ words, “PeoP 
have the ability to orde 


j xo dimer 
ie hkk r elements in a cognitive space, of at least mo dif- 
ions, which has properti : 1 » The 
Properties not found in geometric space [p. 357]. 
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ficulty of problems involving relations that are naturally represented in this 
cognitive space can be predicted from their spatial representation, while those 
involving other relations apparently depend more on linguistic complexities. 


GRAMMATICAL-SPATIAL CORRESPONDENCE 
A different, but overlapping, interpretation of such findings by Hutten- 


locher (1968) was based on her previous observation of children’s difficulties 


in arranging blocks on shelves, a concrete form of the three-term series 


problem. If blocks A, and Az are in place and block B is to be placed in 
the middle, the descriptions can be stated with B as the subject (B is above 
Aj; B is below Az) or as the object CA: is below B; As is above B). It seems 
to be more natural to think of the mobile block, B, as the subject; and, in 
fact, when B was described as the object, there were many more errors and 
correct placements took longer. When a toy truck pushed or pulled another 
toy truck, it was easier to place the mobile truck if it was described as the 
subject. But for passive statements, as when the truck was pushed by an- 
other, it was easier to place the mobile truck if it was described as the 
object. Huttenlocher’s generalization is worth serious consideration: Com- 
prehension requires a correspondence between the form of a linguistic 
expression and the situation it describes [p. 553].” : 

To apply this interpretation to college students reasoning about three- 
term series problems, Huttenlocher arranged series which compared people 
by height: Tom is taller than John; Sam is shorter than John. Assuming 
that the first premise establishes the relation between the first two terms, 
Tom and John, the hypothesis was that the third term, Sam, is more ne 
located when it is the subject of the second premise, as 1n the above et e, 
than when it is the object, as in the following example: Tom i a ree 
John, John is taller than Sam. Various combinations of premises with boys 
students, followed by the question Who is 
y easy tasks, of course, and the 
do show some differences. When 
were fewer 


and girls’ names were given t 
taller? or Who is shorter? These are ver 
response times were brief, but the results ) 
the third term was the subject of the second premise, po k 
and times shorter than when it was the object. These problems, written 


i i itive space by spatial imagery, 
in talle 7 be ordered in cogn! y 1y 
# 5d sarten; waa the two sets of data on relative 


according > al., and enerally eae oe 
Uel of ilena ee recite But the end-anchoring interpreta- 
tion for certain comparisons is replaced by the ae eh D i 
ia) arranged by Hutten oc er with ver s that 
ding John, and a passive form, John is led 
is first? or Who is last? This new vari- 
s to be compared, but by and large the 
ded on the grammatical status of the 
form of the premise. As in the above 
hen the mobile term was described 
e object. Passive premises— 


Three-term series were al 
have an active form, Tom is lea 
by Tom, the question being Who 
able doubled the number of serie 
difficulty, in terms of errors, depen’ 
mobile third term and the grammatical 
Experiment, active premises were easier W n 
as the subject than when it was described as t 
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i ile rm was 
i i —w when the mobile terr ? 
which take longer in general were casier at piene Vea fen 
described as the logical subject even though it was the grammatica 
nile i i in which the mobile third term, 
An illustration of the most difficult type, in which the s 
i bject but the loøical object, follows: Ton 
Sam, is the grammatical su ject but the logica ; 
leading John, Sam is led by John. 
Huttenlocher’s experiments and inter 


; Ta > her sub- 
pretations were guided by her 
jects’ introspections, revealing sp 


; > li istic 
atial imagery, but it was the ote 
i Bei ce ial imagery 
forms that were experimentally manipulated. Evidence of spatial : oe 
. 5 . 3 w 3 a or 
was not directly obtained in these experiments, but the evidence 
influence of grammatical structure is quite convincing. 
LINGUISTIC PROCESSES 


Three Principles derived from linguistic theory were i a tee 
problems by Clark (1969a). The Principle of the primacy of Sah Hee 
relations asserts that the conceptual relations out of which sengeners iiy 
constructed are stored, immediately after comprehension, in a more e a 
available form than other kinds of information, such as the superficial ae 
matical structure of the sentence, Underlying John is better than : va 
for example, are the base strings, John is good and Dick is good, Peh ma 
be transformed to John is more good than Dick is good, or John ae ia 
than Dick is, and so on. Comparative sentences, then, include the song 
relations, such as John is good or John is bad, as well as the ara ee 
information; but the first kind of information is more available. ep oat 
John is worse than Pete, the listener realizes that John and Pete are bad m 
readily than that John is more extreme than Pete 

The principle of lexical mark 
like good. Good and bad are 
jectives, but good is also use 
general. The good unde: 
interpreted cithe 
contrastive sense 
be interpreted in 
than Jack can on 
cerned with 
(1970) h 


tion between more and 
they 


in badness. Bees 
ing Separates two senses of certain cy spon 
used in an evaluative sense, as contrasting sa 
d in a nominative Sense, to refer to goodness ee 
tlying the sentence John is better than Pete aa 
r nominally or contrastively, but since the lexically mark 
takes longe | 


É J A $ sak is worse 
the nominative sense. The bad underlying Dick is 


asked to make various choices of Or 
nonym of more, apparently because the nominat 

e the contrastive, 

of congruence 


Š re asserted 
simply asserts what others have a 
8°, namely 


» that answe 
surface question, 
of functional rel 


an 

ring a question requires mor a 
It requires information that = for. 
ations, with the information asked of 
ark constructed 32 problem ci oe 
estion forms, as listed in Table 6- 5 
search on three 


covered in previous re 
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If John isn’t as good as Pete, 

And John isn’t as bad as Dick, 

Then who is best? 

Dick Pete John 

The principles of primacy of functional relations and lexical marking 
predict that Problem Types I and II’ will be solved more rapidly than II and 
iy respectively. Table 6.2 shows that, when college students did these 
little tasks, the predictions were confirmed. The principles of primacy of 
functional relations and congruence predict that II] and IV’ will be solved 
more quickly than IV and III’. These two principles also predict that 
solutions will be faster for I and I’ when the question is Who is best? and 
for II and I’ when the question is Who is worst? The data of Table 6.2 


TABLE 6.2. GEOMETRIC MEAN TIMES (IN SECONDS) FOR SOLVING THREE-TERM 


SERIES PROBLEMS 


Form of Question M Overall M 


— . 


Form of Problem 
Best? Worst? 


Ca) A better than B; B better than C 5.42 ead = 5.49 


Cb) B better than C; A better than B 4.98 5.52 5.25 


Ca) C worse than B: B worse than A 6.27 6.53 6.40 S 


H 
Cb) B worse than A; C worse than B 5.93 5.04 s 
Ca) A better than B; C worse than B 535 534 5:34 
1 om 
Ch) © worse than B; A better than B 4.84 5.84 5.32 
9 
py ©) B worse than A; B better than C Re EN Oe 
Cb) B better than C; B worse than A 6.12 5.45 5.77 
=o 34 
P Ca) A not as bad as B; B not as bad as C 6.77 5.95 6.3: ee 
Cb) B not as bad as C; A not as bad as B 7.16 6.56 685 
5 6.63 6.08 
Ir Ca) C not as good as B; B not as good asA Rag 6.22 
6.35 
Cb) B not as good as A; C not as good as B ie 
6.66 6.50 
lp Ca) A not as bad as B: C not as good as B ee : À 6.52 
Cb) C not as good as B; A not as bad as B RO im “i 
6 6.18 6.14 
Iv Ca) B not as good as A; B not as bad as G SW 6.19 


Cb) B not as bad as C; B not as good as A 5.48 7.12 6.25 


From Clark (1969a). 
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are in general agreement. Linguistically negative statements ne P 
more transformation than logically equivalent positive statements, me = 
comparison of the upper and lower halves of the table indicates oe = 
tional time was required. Clark's three linguistic principles do not a 
for all the differences in Table 6.2, such as those between the a and b 
variations in I, II, I’, and II’, but the interpretation is more complete ein 
previous ones had been. And a similar pattern of difficulty values was o ‘ 
tained when difficulty was measured in terms of proportion of solutions 
within a 10-second time limit (Clark, 1969b). 

Clark’s data agree with those of De Soto, et al., in most respects, but the 
interpretations differ. The relations that seem, according to spatial = 
logic, to be easily visualized in a vertical direction are the ones that Clar 
believes are unmarked, hence the principle of lexical marking and the 
principle of directionality would make the same predictions. The special 
case of deep-shallow is critical because deep is represented spatially as below 
shallow; thus, according to spatial paralogic, problems written in deeper 
should be harder than problems written in shallower. But Clark’s data in- 
dicate the reverse and, since deeper would be unmarked, his principle of 
lexical marking would be in agreement with this. , 

Clark’s data agree with Huttenlocher’s in most respects, but Clark's 
interpretation, unlike those of Huttenlocher and De Soto, et al., includes 
the negative statements, Introspectively, the spatial assignment of negative 


. 5 fj . assive 
Statements is confusing; on the other hand, statements written in passi 


form may be more adequately handled by Huttenlocher’s principles than by 
Clark’s. 


With Clark's analysis, the interpretation of three-term series problems 
has moved from spatial representation to grammatical-spatial correspondence, 
to a linguistic interpretation that makes spatial representation unnecessary: 
It is still possible, of course, that spatial representation is involved under 


+ aoe . ; i i n- 
some especially favorable conditions but is abandoned for linguistic reas° 
ing when conditions are not fav 


baat ing 
orable. As in the case of syllogistic reason" & 
differential training may assist 


d à taa when 
7 the choice between alternative theories Wh 
a mere tabulation of errors and times does not suffice. 


The Strange Case of the Awkward Negative 

Te special difficulties that negative information brings to inductive 
reasoning (Chapter 2) cause trouble in deductive reasoning ‘also, as we have 
seen in the solution of syllogistic and three-term series problems. Althoug? 
negative instances do have a Special value in the case of disjunctive classes; 
in the more common cases they are troublesome and therefore deserve 
examination. Suppose we are concerned with a closed system in which we 
know that an event is determined by either X or Y. and we have 4 hy- 
pothesis that it is due to X. This hypothesis can be evaluated directly» by 
trying X to see if the outcome is positive, or indirectly, by trying Y t° se? 
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if the outcome is negative. But negative information is not used in the same 
way as positive information, even when the two are logically equivalent. 
There are analogies in other areas of psychology. The effects of punish- 
ment are not equal, in the opposite direction, to the effects of reward. 
Unpleasantness is not merely the other end of a scale of pleasantness. In 
the case of deductive reasoning, the important factors to be considered in 
analyzing the function of negatives are the logical propositions, the language 
in which they are stated, and the nature of the logical system or universe 
that is assumed. Some ingenious experiments have been designed, mostly 
in Great Britain, to isolate the effects of these factors. The first mentioned 
here are based on single problems that require 15 minutes or more, and the 
others on a series of short tasks in which speed of processing information 


is emphasized. 


PROBLEM SOLVING 

An experiment by Donaldson (1959) is a good illustration here. Grade- 
school students were required to construct a system of statements that would 
give sufficient information to match five objects CA, B, C, D, E) to five 
other objects (1, 2, 3, 4, 5) In one problem, for example, boys named 
Arthur, Bob, etc., were to be assigned to schools numbered 1, 2, etc. The 
subjects were told to construct an economical system with a small number 
of statements and, furthermore, that positive statements, such as A goes 
with 1, cost five points while negative statements, such as A does not go 
with 2, cost only one point. Thus in the best solutions, a ——- 
would be explicit and the positive matches would be derived ~~ , 

In general, the subjects avoided positive statements, as instructe 2: ough 
they wrote more than the optimal negative statements. The introduction 
of one or two positive statements, in spite of their high cost, made later use 
of negative statements much easier. More than half the subjects were able 
gradually to eliminate all the positive statements and ee write a 
set of negative statements only. Similar results were obtained with college 
students under somewhat different conditions. 


An interesting difficulty showed up in second-order derivation. Stating 
a É es indirectly that A is 1. Furthermore, 


that A i 5 determin j 
apni dag A is 1 can be derived secondarily the 


fron aoe 2 . | t 

n the positive derivation tha ó 

negative Eteni that B is not 1. However this iy of pened 
j B . ğ =< 

tions was generally avoided. While the instructions emphasized deriving 
ò aag ee 


Positive statements from negative, @ positive match, once derived, was 
a 

treated with finality, as an end in itself, rather than as a step to further 
reasoning , i 

g Pi z th j 

The problem that Wason (1960) employed. to arg k : P f 

quired generating a rule from particular pair eben h bios ‘ger A 

; 5 s Š i iliza- 

inductive reason but the important guenon wee See Be utiliza 

5 > hat could eliminate a hypothesis. The instruc- 

Il be given three numbers which conform to a 

g 


_, Your aim is to discover this rule by 


tion of negative examples t 
tions were simple: “You wi 
Simple rule that I have in mind. » 
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ai dhiar i een for vour choice 
writing down sets of three numbers, together with reasons \ 
[p. 131].” A positive instance, 2, 4, 6, was displayed as a starter. n 
: j i i i g = ree YS ii 
The rule that the experimenter had in mind was “three numbers o 
creasing order of magnitude.” The trivi 


ality of this rule for college stu 
` lts, since they wrote many sets 
exemplifying more complex rules that were positive instances, ha ies 
confirmation was not helpful in narrowing down the possibilities } S 
subjects seemed to avoid negative examples that could climinate a A 
pothesis, but those who were most eficient wrote more negative examples 
and thus eliminated alternative hypotheses quicker. 


dents may have some bearing on the ri 


Campbell (1965) followed up these experiments by one with word pe 
lems which were to be solved bv replacing the letters of the words with 
other letters. Problems in which the substitutions could be reasoned gut 
directly were easier than others, involving the same words, in which the 


See ey ek à C alterna- 
substitutions could only be made indirectly, by the elimination of alteri 


s em s 3 E E OO in 
tives. The difference was found even in groups given brief instruction 


indirect procedures, 
Campbell attributes part of the difficulty to the nece 
complementary classes, since the re 


ty for dealing with 
‘asoning required consideration of A het 
only as A but also as not B, not Ç, ete. Piaget's children had trouble (see 
Chapter 3) in understanding that primroses and not primroses were in 
cluded in the supraordinate class flowers, 
had to do this incidentally while solving 
difficulty, according to Campbell, was the 
from A is 1 to B is not 1 
positive statement, 


» and Campbell's college —— 
a problem. Another factor in i 
change of logical subject. To infer 
involves not only going beyond the finality of a 
but also shifting to a statement on another subject. } 
All the above evidence on the sequence of terms and the role a 
negative is compatible with the assumption that there is an upper limit is 
the amount of information that the individual can handle and that as he 


7: š Tea Sims i f ma- 
approaches this limit, he makes errors. Presumably, he retains the informa 
tion that first comes to him whil 


to it. If he can relate the 
solution is easy. But 

negative to affirmative, 
load may a 


e taking in new information to be —— 
new information directly to what is being retaince, 
if he must transform the new information, as from 
or reverse the order, 
pproach his capacity 
of infomation may, ; 
many proble e 


or change the subject, the rn 
and errors may occur. Transforming one 
as a guess, be as much work as retaining two bits, anc 
ms require simultaneous retention and transformation. 


INFORMATION-PROCESSIN 


G TIME 
The processing of affirmative 


O” z com- 
and negative information can also be 


at 3 e 5 ittle else- 
pared by uming performance on simple tasks which require little 
These are not full-scale reasoning 

S 


fs me 

= problems, but the results clarify is E 
of the difficulties in handling the information in the more complex probte à 
described above. Wason C1959), for example, exposed simple displays 
information together with four types of statements to be aligned with them. 
E: quire “ae j ma- 
The time required for true affirmatives was 8.99 seconds, for false affirms 
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tives 11.09, for true negatives 12.58, and for false negatives 15.17 seconds. 
Possible explanations, according to Wason, are a habitual set for positive 
statements, the inferential nature of the negative information, and an emo- 
tional response to negatives. One subject said: “I don’t like ‘not— it’s a 
horrid word.” 

Following up these possibilities, Wason (1961) made use of odd and even 
numbers to construct binary statements in which the probabilities are equal 
Seventy-eight is an odd number. Fifty-seven is not an even number, There 
were four types of statements: true affirmative, false affirmative, true nega- 
tive, and false negative. The subjects, students at London University, needed 


only a few seconds to verify these statements as true or false, but they veri- 


fied the affirmative statements more quickly. 

Such tasks can also be written in completion form, 
types of statements. “Complete this statement with a number that will make 
the statement true: — is an even number.” “Complete this statement with 
a number that will make the statement false: — is not an odd number.” 
like the verification task, response was faster to 
and, in addition, response was 


with the same four 


On the construction task, 
affirmative than to negative statements, 


faster under “true” than “false” instructions. 
In a binary task, the negative can easily be changed to the affirmative, 


just as the false can easily be changed to the true. But the results showed 
that the negative introduces more difficulty than falsity. Instructions to 
make a sament true are superfluous, since this is the habitual thing to 
do, but instructions to make it false must be retained in memory. And 
the completion of a negative statement as true seems to require also thinking 
of the contrary of the whole statement. Completion of a negative statement as 
false could casily be done by canceling the negative and the false components, 
but the subjects apparently preferre time-consuming 


d to perform two separate, 
Operations. There is a curious inconsistency, perhaps only superficial, be- 
tween these effects and the atmosphere 


effect which makes a negative con- 
a an affirmative one. 


Results such as chese led Wason and Jones (1963) to draw a distinction 
d not in reversing the truth 


between the denotative function of the word | ee 
value of an affirmative statement and the possible porin ii or 
connotations of this familiar English word—which may be different in 


other languages. The methodological problem was to eliminate be connota- 
tions of not, ‘and once again the valiant nonsense syllable nar psychology 
from the coils of meaning. One syllable was assigned as a sym ol for affirma- 
tion and another for negation. Statements such as the four below were 
shown to the subjects, with a tick indicating true and a cross indicating 


false on the first four only- 


clusion more acceptable thar 


7 MED an even number “ $ 
4 MED an even number X 
8 DAX an even number “ 
3 DAX an even number X 
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Some subjects checked a large number of such implicit statements, = 
others checked corresponding statements containing the explicit is a rs aS 
Although a few subjects reported that they translated the ra 3 a 
into explicit meaningful language, the overall results showed that ma : te 
ence between responses to affirmative and negative statements was smalle 
for the implicit than for the explicit format. Wason and Jones os ae on 
the basis of such data, that the purely logical function of denial, w hen 
symbolized by an innocent nonsense syllable, makes little difference ¥ 
time of response to affirmative and negative statements; but conventiona 
verbal interpretation of the negative does make l Be 
the specific effect of the stimulus not makes an additional difference. in 
data can also be explained by Clark’s (1969a) principles of congruence and 
lexical marking, assuming that false is marked and true is not. 

A related logical effect turns up as a consequence of instructions to make 
exceptions. If we have 2 classes of items, A and B, i 
mark the As directly or we can ask them to mark all except the Bs. Jones 
(1966a) compared these instructions by having students, aged twelve to 
fourteen, perform a canceling task with rows of 8 digits, specifically, the 
digits 1 to 8. The instructions were stated by inclusion, such as: “Mark the 
numbers 3, 4, 7, 8,” or by exclusion, such as: “Mark all numbers except 
1, 2, 5, 6.” Performance under exclusion instructions was considerably 
slower. The errors were not much different in total numbers, but they 
tended to be errors of omission under inclusion instructions and false 
positives under exclusion instructions. To be sure, such instructions, though 
symmetrical, are rather unlikely; the excluded class is usually smaller 
than the included class, as in “The library is open evenings except Satur 
day.” When Jones compared instructions to mark 5 digits with instructions 
to mark all exce Ferences of the same kind were obtained. 

The aw exceptions suggests that the instructions might 
nes (1966b) told other students, of the Si 
numbers except 3, 4, 6, 8, etc., emphasizing 


* g a 
S, and then, after thorough instructions and 7 
the students: 


ents given the exc] 


a significant difference, and 


we can ask subjects to 


È : o. 
“Tell me what you are going to d 


as students given the ee 
ere given exclusion instruction? 
rmed much more slowly. Surely 
variations to be tried out. There 


must be a poi shi ae 1G 
point at wh structions are more efficient. Pe 


haps with such ratios, 
instructions w 
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to verify. The rules were made by pairing 1 value of 1 attribute from each 
figure with a logical connective. The connectives were either conjunctive 
Corange and small) or disjunctive (orange or small). Each rule was trans- 
formed by negating the first, the second, or both aitributes. For example, 
the conjunctive statement orange and orange underwent 3 logically equiva- 
lent transformations: Not-green and orange, orange and not-green, and 
not-green and not-green. The times required to verify such conjunctive rules 
by comparing them with the figures in view were, respectively: 0.634, 0.736, 
0.736, and 0.824 seconds. Thus these results are similar, in principle, to 
those obtained by Wason (1959). But Trabasso went further and assumed 
that the difference between 0.736 and 0.634 was the time required for one 
transformation. Adding this difference to 0.736 would estimate the time 
for 2 transformations, 0.838 seconds, which is very close to the obtained 
0.824 seconds. Other calculations of this type, all with the data of one well- 
Practiced subject, working in 2 sessions a day for 32 days, indicate that 
Storage of a rule requires more time than verification and that, as Wason 


reported, confirmation is easier than disconfirmation. 


OPEN AND CLOSED SYSTEMS ; 

One error that is emphasized in the above research is the failure to make 
a positive inference when the alternatives are negated. It is a pate. com- 
ment on trends in psychological research that a quarter-century 380 vea 
(1945) was warning psychologists against a prevalent error ak e opp 
direction, under the title: The Credence Given to One Hypot = I cae! 
of the Overthrow of Its Rivals. The error that Morgan seri is base 
on the distinction between a system in which the possible alternatives are 


i so limited. 
knowr imi d stem that is not limi 
n and a sy : 
io pe limia j lustrating a closed system, informed the sub- 
s J 


orean = Ani problem, dU f which would ring a bell, then gave them a 


jects about five buttons, one o 
series of questions: ae Di 
oy many chances are there in a 100 that the bell will ring if you press 
uo einn Avand ‘the bell does not rings os a ea chances 

are there in a F00 that the bell will ring if you press butto P- 
vritten around a murder and five men who had 


s and two other examples illustrating open 
' tudents following the bell problem. 


i ir probability estimates on the bell 
Two-thir students raised their p no ao 
PR a of = the five alternatives were successively eliminated, but 
m as four a 


; the five murder suspects also. 
h i n the case of 3 
alf of them did the same i several criticisms, to be sure, but more 


ae of this ep oe E hae the general trend, that intelligent 
re yi ro s 

adie ofen anme a closed gema of ematia, Whether waranced or 
not, and reason from the negation Skon aE RR ot tee ‘al a 
circumstances under which this type of To EN a oe Si 
in such detail as the complementary error describe j 


Similar series of questions, V 
been rounded up as suspect 
Systems, were given to college stu 
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Implications of Conditional Statements 


Conditional sentences, common in scientific reasoning, do — 
categorical assertions but describe hypothetical relations. If X is a fis se 
can swim. If Henry passes the aptitude test, he gets the job. If i 
is studying problem solving, she is studying psychology. Reasoning oF : 
sort is an extraordinary, unworldly venture: describing hypothetical pore’ 
talking about them in conditional terms, and proceeding, on the baait : 
consistency, to logical consequences that would be true if . . . It is not a 
prising that errors occur. Actually, some of the implications of ene 
statements are quite straightforward and easy; others give rise to severa 
identifiable logical mistakes that demand psychological interpretation. 

To clarify conditional sentences and to separate the formal structure from 
the content, logic books often represent statements of the form, If P, then Q, 
as P—> Q (Ehlers, 1968). P is the antecedent proposition, while Q is the 
consequent, and the negation of these can be represented by P and Q. 


| ; , bears Ss eeh and 
Here is a valid argument, affirming the antecedent, written in English an 
in logical symbols: 


If X isa fish, then X can swim. P3=Q 
X is a fish. 


P 
Therefore, X can swim. v2) 
A valid argument that denies the consequent: 
If X is a fish, then X can swim. PQ 
But X cannot swim. OQ 
Therefore, X is not a fish. g 


A common argument, 
the antecedent: 


If X is a fish, then X can swim. P>Q 
But X is not a fish, 


P 
Therefore, X cannot swim. O 


affirmation of the consequent: 
If X is a fish, then Xe 


PQ 
X can swim. 


Therefore, X is a fish, : Pp 


i P ; ial of 
which is fallacious because it depends on denial 


And another fallacious argument, called 


an swim. 


When the conditional statement P 
minor premise, P, Q, P, or 6, can be 
as in the first two ex oning also proceeds in the other 
direction. Various combinations of p Q, P 5 


s a 
$ ` o se, 4 
->Q is taken as a major premis 


A duced 
and a conclusion can be deduced, 
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of the rule is quite difficult for intelligent adults without training in logic. 
One device that logicians have found to be helpful in evaluation, a truth 
table, is shown in Table 6.3. This table is to be read as follows: in the first 
case, when P is true and Q is true, the statement P— Q can be affirmed. 
In the second case, when P is true and Q is false, the statement P > Q is 
false. The third and fourth cases are consistent with the truth of the con- 
ditional statement. 

With this little excursion into the logic of conditionals, we are now in 
a position to look at the empirical evidence. The fallacies of denial of the 
antecedent and affirmation of the consequent have long been recognized, 
and we know that college students are moderately vulnerable to the former 
and very vulnerable to the latter CWason, 1964). When we look for a 
psychological explanation of such difficulties, we see several possibilities. 
The intrinsic nature of the logical structure is not understood by all 
reasonable men and women. The sentence P—> Q may be read as a state- 
ment of temporal or causal relations although only logical relations are in- 
tended. \s we have noted earlier, logical forms grew out of the general 
acquired formal constraints of their own. Another 
s of conditional reasoning involves the semantics 
avor of then can be misleading. Even 


language but they have 
hypothesis for the difficulties 
of the enigmatic If. The temporal fl 
such terms as antecedent and consequent have temporal connotations that 
may be logically confusing which is one reason why logicians prefer their 
astringent symbolic language: P— Q. Still another hypothesis is the difh- 
culty, noted above, of negation and falsity. A general set for truth seems to 
be more reoular. Hence P. C. Wason, P. N. Johnson-Laird, and associates 
at Universit, College, London, undertook a series of analytical ig eon 
to tease out the various psychological factors that produce these logica 


difficulties. 


The materials of sever: ee ee i aes 
side and letters on the other, and the procedure aas a anaron of the 
s used in studies of concept attainment CWason, 1968). 


were placed before each college stu- 
as to evaluate the validity of the rule that if there 
ita, Doni one eile bane card, there_is a 3 on the other side. (In logical 
Symbols, th ard were P,Q P, and Q respectively.) The subject was then 
S) $ the cards were P, S fr 9m2 

asked which cards he would turn ove 


re cards with numbers on one 


al experiments we 


selection procedure = 
Four cards, showing D, 3, B. and 7, 


dent, with instructior 


r to determine whether the rule was 


TABLE 6.3. A SIMPLE TRUTH TABLE 


Case P Q P>Q 


mam 
Hama 


} 

2 T 
3 F 
4 F 
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true or false. Every subject selected D, of course, because it was ao. 
that a 3 on the other side would confirm the rule. Selecting 7 would oe 
equally informative selection because a D on the other side aot l a a 
the rule by denying the consequent. In fact, Table 6.3 shows that sal 
joint occurrence of O and P, in this case 7 and D, is the only contingency 
that falsifies the conditional. But few subjects made this selection. Knowing 
from the research summarized above that negatives are awkward, Wason 
suspected that most people are more accustomed to truth than to falsity 
and therefore find it awkward to falsify a rule by denial of the consequent. 
This appears to be a promising interpretation, even though SS 
emphasis on falsity did not improve performance, compared to that of a 
control group. = a 

Denial of the consequent implies denial of the antecedent, or QP, 
but to many people denial of the antecedent appears irrelevant to the argu- 
ment. If X is a fish, X can swim, but one could say that If X is not a fish, 
all bets are off. This is an ancient way of interpreting conditional sentences, 
out of place in a two-valued logic which insists that propositions be true 
or false, but it may explain what many people actually do. Therefore, 
Johnson-Laird and Tagart (1969) asked how college students would evalu- 
ate conditional statements, given letters and numbers on cards as in the 
Wason experiment above, when judgments of true, false, and iene 
were allowed. When P Q cards were presented, the conditional was judge 
true; when P Q cards were presented, the conditional was judged false— 
both valid inferences. But when P Q 
judgment was usually irreley 
tion that the subjects w 


or P O cards were presented, the 
vant, The data were consistent with the rag 2 
ere operating with an incomplete truth table, like 
Table 6.3, but with the “truth value” of cases 3 and 4 considered irrelevant. 
In other words, the subjects did not grasp the intrinsic nature of the condi- 
tional argument and missed the information contained in half the instances- 

The experiment included, for comparison, the disjunctive form: Either 


ae : 5 ah an 18 
there isn't P, or there is Q. This expression has the same implication ` 
denial of the antecedent in the conditional form, but the disjunctive forn 
seldom led to the judgment of irre 


levant. The disjunctive form was the one 
on which the most time was spent, hence it is possible that the awkwardness 
of this expression slowed down the subjects and made them approach a 
logical task more analy 


Ytically, Following up such leads, Wason and Johnson” 
Laird (1969) studied 


E 5 2 h: á nts 
i disjunctive rules systematically, asking college stude 
to select instances to show i 


that a rule was false and, in other examples, t° 
show that a rule was true, These expressions, like others, were difficult © 
reason with when negated but, more important reasoning with disjunctive 
expressions was more efficie : 


5 A itiona’- 
3 nt than reasoning with denial of the conditio: 
The Either .. . or sentence 


i ore 
5 z itself creates uncertainty and demands m 

analysis than the conditional If , - then sentence. Say the authors: 

vs Bates g ith a 

With a conditional the individual ig likely to be confident but wrong; with 

disjunction he is more likely to be unco 


mfident but right. The meaning of a 
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conditional gives no hint of the negation or falsity which underlies its logic. 
The disjunctive expression makes this element explicit, but this seems to weaken 
the grounds upon which any inference can be made [p. 20]. 

Johnson-Laird and Wason (1970) have offered a tentative but plausible 
these cases, based on the above inquiries 
differentiate the selection process, which 
n, from the evaluation process, which 


description of reasoning processes in 
and several others. It is necessary to 


is turning over a card to seek informatio 
is a judgment about the effect of the information on the truth or falsity of the 


rule. The naive subject, who has not memorized the truth tables of the logic 
books, has a kind of incomplete truth table derived from his implicit knowl- 
edge of the language, which contains information only about those contin- 
gencies that make the rule true. He has to infer from this information which 


contingencies would falsify the rule when his incomplete truth table creates 


a bias toward verification. Thus he considers only cards mentioned in the rule, 


namely, P and Q, and tests only whether they could verify the rule. Those 
subjects with partial insight are likely to ask whether any cards, other than 
those which could verify the rule, could falsify the rule. Those subjects with 
complete insight ask whether any cards could falsify the rule. According to 
these authors, as insight into the logical structure of the conditional sentence 
is acquired, more reliance is placed upon tests of falsification. — 

In general, then, the difficulty of reasoning about conditional sentences 
arises out of their hypothetical nature. The thinker has to process proposi- 
tions that are not only abstract, like all general propositions, but are also iffy. 


Even propositions that appear irrelevant to the uninitiated have logical 
Consequences. This hypothetical characteristic is complicated by linguistic 
factors because If . . . then constructions, being common 1n ordinary discourse, 

And, although informa- 


appear more straightforward than they logically are. ne i 
tion that falsifies a statement is logically valuable, the processing © such in- 


difficult because most people are set for truth—in this 
the logical structure of conditionals 


the consequences of falsification. 


formation is especially 
Situation as in others. Thus, grasping 
Consists largely of attaining insight into 


Learning to Reason Logically 

Reasoning i ultivated performance 5 z 
mind in a rat Aa is not iain irresistibly to eae mR 
to reason depends on, and goes beyond, the learning 0 — i ogical 
Connectives—such as if, then, and, or, and so e pe Se tonal as class 
Concepts—such as tree, sonnet, and gle ee ouble borders—are 
earned. Rather, they are learn «t of sentences, as functional 


ed in the contex 
Parts of statements. Learning t° reason also depends on, and goes beyond, 
learning the general language in which statements On jade: Part of the 
training in A og consists of practice ny WES Minila concepis and 
l 8 tically; but a large part consists also of learning 


e that requires practice. A sound 


a 
Nguage structures more Cl 
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new terminology (e.g., conversion), new formal structures Cogs ee 
> apping statements into truth tables, and falsific 

and new procedures (e.g., mapping statemen 

Be training is effective even in small amounts. It is not surprising fhar” 
college course in logic improves skill in handling logic problems, “0s =i 
young men have acquired some facility in the calculus of pepe ae se 
two hours of instruction (Moore and Anderson, 1954). A game p agen nth 
logic cubes is said to change the drudgery of learning the calculus of Le oe 
tions into an enjoyable pastime even for schoolchildren (Allen, | F 4 
Special instructions with diagrams and examples helped a group of ad 5 = 
reach a high degree of accuracy in syllogistic reasoning in a short time anit ; 
demonstrate that the distorting effects of bias can be minimized (1enle anc 
Michael, 1956). In Hunter's (1957) study of three-term series problems, i 
improvement appeared between the first and last halves of a set of 16 abe ~ 
lems. Such improvement would not occur, of course, if formal reasoning were 
a routine performance that everyone practiced daily. Nevertheless, n 
people do learn such skills by themselves, and some make better use of W = 
they have been formally taught than others. Hence, when intelligent adults 
who have not studied logic are compared on reasoning tests with those wl 
have, the averages differ as expected, but there is considerable overlap be 
tween the groups (Morgan and Morgan, 1953), 


From the psychological point of view, educ 
bles educ 


ation in logical reasoning an 
ation in other systematic disciplines. It takes off from the genera 
language and consists of learning special concepts, operations, and papae 
Practice in putting verbal patterns into classes, such as conditional propos 
tions, and naming them facilitates m 
manipulating the classes 
such as conversion, are 
analyzed and the argume 
and restrictions of form 


astery of the concepts and skill a 
as units in further reasoning. Rules for mg? 
memorized and practiced. Ordinary discourse ba 
nts rearranged into formal patterns. The ae ae 
al logic are distinguished from those of ordinary SEN 
munication much as, in other disciplines, fine art is distinguished aa 
applied art and pure science from technology. h- 
Many of the difficulties described in this chapter can be reduced by te* r 
niques that make abstract relations more concrete. Euler’s circles offer a 
example of a visual aid that displays relations of inclusion and exclusion " 
volved in syllogistic reasoning. t 
S is P is shown by pl 
circle to indicate” 


7 $ sition 
In the illustration below, the paps the 
. < å . x S e 
acing the S circle inside the P circle and P outsid 

5 stem, 


the not-P space. Venn diagrams, another graphic jee bv 
and more generally useful; and truth tables Cillustra i 
: re > i lations: 
widely exploited as an aid in analysis of logical relation 


are more complex 


Table 6.3) are 
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Psychologically, all these schemes may be considered alternative representa- 
tion systems for illustrating abstract logical structures on a two-dimensional 
surface. Teachers use these, as they use maps and models, to demonstrate 
relations between elements, and such demonstrations have been helpful 
to third-grade children working three-term reasoning problems (Morgan and 
Carrington, 1944). But the most improvement occurs when students are 
rming or recoding the information from one repre- 
sentation system to another. Many of the exercises in logic books consist of 
practice in representation and transformation. Whimbey and Ryan (1969) 
have interpreted the results of a training experiment with college students 
by assuming that syllogistic reasoning depends on a good short-term memory, 
as tested by a modified digit span test, and that practice in representing the 
logical structures by Venn diagrams reduces this dependence. When the 
logical structure of a given problem is appropriately represented and matched 
to a familiar pattern, conclusions can be tested almost mechanically—much 
as complex algebraic manipulations are carried out by the mechanical appli- 
d algorithm. When all instances to be considered and 
tively, by any representation 
and of falsification are mini- 


given practice in transfo 


cation of a memorize 


all logical consequences are represented exhaus 
ative information 


system, the difficulties of neg 
mized. 
Since logical analysis of an 


of fallacies is a prominent feature of trainir 
includes the classification and naming of fallacies, along with exposition of 


the logical flaws. The procedures of Simpson and Johnson (1966), aimed 
specifically at atmosphere errors, were fairly successful. Wason (1964), how- 
ever, has worked out an ingenious procedure for getting the student to dis- 
cover and correct his own fallacies without the mediation of any technical 
terminology. The reasoning tasks were conditional statements of the form 


Tf P, then Q. One task. called the antecedent task, consisted of using the 
a statements to complete the following 


or more, will receive a salary of at 
one statement asserted that a 
-four received £1,800. It is tempting to infer 
y must be more than £1,800. But the next 
aged thirty-seven received £1,600. 
a statement suggesting a 
an invalid inference 


a critical purpose, the detection 


argument has 
ò f : wa 
ng in reasoning. Training often 


in a series of ter 
4 years, 
+ For example, 


information given 
tule: “Any employee aged 3 
least a_year [p. 30]. 
Particular employee aged twenty 
that if this is so, the critical salar 
r employee 


statement asserted that anothe 
arranged so that 


The order of the statements was 


valid inference was followed by a statement suggesting | oe 
consistent with the one before, then a statement suggesting a valid inference 
llacy, and so on. Thus college students 


inconsistent with the preceding fa hited een 
who drew invalid inferences were able to detect their own logica! errors and 
ene theni: ired completing the foll 
Another task, called the consequent task, required comp ge SOM: 
ino ; ý hi or more, will receive « 
ing incomplete rule: “Any employee aged š .” sina 
salary of at least £1,900 a year [p. 30].” A corresponding series of statements 
leading to alternate valid and invalid inferences was arranged. This task 
a . 
Was more deceptive. Fallacious reasoning from the consequent to the ante- 
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cedent was twice as common as from the antecedent to the consequent. bet 
on both tasks, experimental programs produced more recognition of scion 
than control programs in which successive valid and invalid inferences were 
not contradictory. It is interesting to note also that the improvement in the 
experimental groups occurred in an all-or-none manner. Those students who 
eliminated their errors did so quickly when the contradiction became aj: 
parent and did not err again. The small improvement in the control groups 
was more variable. Teachers of reasoning, as far back as Socrates, have con 
fronted students with their inconsistencies, but this procedure, which might 
by analogy with programmed learning be called programmed reasoning, is a 


: £ r is own 
systematic teaching procedure that allows the student to confront his ow 
inconsistencies and correct them. 


Self-contradiction was included also in 
Wason (1969) for teaching the best infe 


conditional reasoning tasks used by the London group in the research nie 
tioned above. We should bear in mind here that insight consisted mainly i 
grasping the logical value of falsification and that mere emphasis on falsifica- 


tion was not helpful. The training procedure first familiarized the subjects 
with all four combinations of instances: P Q, P Q, P O, and P Q. Then the 
experimenter chose cards and asked questions that confronted each subject 
with increasingly obvious contradictions until he realized that O was relevant. 
The success of such “discovery” procedures points to one way in which in- 
telligent adults without training in logic may have learned to reason logically. 


a training procedure devised a 
5 : ue he 
mation-seeking strategy on t 


Reasoning Abilities 


It is relatively easy to construct a reliable test of deductive reasoning, 3$ 
Burt (1919) did in England a half-century 
number of logic problems, like the problems i 
Propriate to the subjects. Such 
and success į 


ago, simply by assembling á 
n this chapter, of difficulty T 
tests predict academic success at all p 
kinds of problems. Good reasoners, in a 
motional bias in evaluating arguments thon 
- Scores on these tests correlate well with ied 
has long been considered an important com 


ficulties in test construction is that of defining 
is rather small: henc a ‘ i ane the fiimber of nen 
of deductive tetris ee no OF ihis domisi ene hard ida; an sts O 
most other abilities e aa constructed more systematically than te uct 
3 it is possible, as we have seen, to constr 
errors in a systematic fashion. Atmosphere 
T independent, and there may be others also 
a vi tag as well as deductive, is included. Analogies composed 


1S, and shapes have been used. The series-completion ite™ 


THE DEVELOPMENT OF REASONING * 265 
based on numbers or shapes, is a favorite. Arithmetic reasoning has long been 
distinguished from facility in arithmetic computation and treated as a com- 
ponent of general reasoning. Scores on tests of general reasoning correlate 
with general intelligence, of course, and predict success in solving problems, 
as we noted in the preceding chapter. 

Several attempts have been made to analyze such reasoning tests, but the 
results vary, depending on the kind of analysis, the tests, and the range of 
ability in the sample of subjects. The conclusion reached by Guilford, Kett- 
ner, and Christensen (1956) after a careful factor analysis may be taken as 
representing the results of one sophisticated attempt. They say that general 
reasoning is not general intelligence, though it is undoubtedly one of the 
important intellectual abilities. It is not a broad ability to manipulate sym- 
bols, but there seems to be a narrow ability of this sort. It is not a generic 
ability to solve problems, though it undoubtedly plays a role in solving prob- 


lems of certain types. It is neither the familiar deductive reasoning nor arith- 


metic reasoning. “By elimination and by consistent indications of a positive 
: ng has something to do 


nature, the best we can say is that general reasoni 
with comprehending or structuring problems of certain kinds in preparation 
for solving them. . . . It may be a general ability to formulate complex con- 
ceptions of many kinds [pp- 171-172].” Performance on reasoning tests can 
also be interpreted as convergent production, in line with Guilford’s (1967) 
general treatment of the nature of intelligence. 
Attempts to separate inductive reasoning ability from deductive reasoning 
ability by factor analysis have not yielded convincing results; Guilford (1967) 
does not include this distinction în his structure of intellect. On the other 
hand, when students in fifth and sixth grades were taught elementary set 
concepts by inductive or deductive procedures, tests of inductive and deduc- 
tive abilities did vield differential predictions of criterion performance (King, 
Robert, and Kropp, 1969). This is one teaching experiment—there are not 
many—in which an interaction between aptitude test scores and teaching 


methods was demonstrated. 


The Development of Reasoning 
how does it develop? 


ated accomplishment, I 
a hildren of diferent ages, Burt (1919) 


d by 40 to 60 percent of the 
levels. Two of these follow, 
-year level. 


If reasoning is such a sophistic 
After collecting considerable data on cl 
selected reasoning problems which were passe 
children of each age and arranged them ìn age leve 
the first at the seven-year level, the second at the nine 


All wall flowers have four petals; this flow 
Is it a wall flower? 

Three boys are sitting in a T 
George is to the left of Harry- 


rm of the pr 


er has three petals. 
ow. Harry js to the left of Willie; 
Which boy is in the middle [p. 73]? 


Burt al d the fo oblems and asked the children to give 
urt also varied the 
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the reasons for their answers to each. The results Jed him to the — 
that by age seven children can avoid most fallacies if obvious a aee 
fourteen they will be caught by the same fallacies if well disguised. 
formal character of the fallacy was not as important i eau 
of material. The picture presented is one of a continuous aaa na 
velopment in general breadth of comprehension and in experience ky ae dl 
important concepts. Burt believed that when the child has a mental ag 
seven, he has all the “mental mech 
development consists of an increas 
the child can perform. 

In the Paris laboratory of Binet and Simon 
was given the assignment of stand 
children. He was more eked 
by the quantitative data, so he turned to a question-and-answer procec 
less rigid but more subtle, directed toward the role 
social and intellectual progress, P 
dren's spontaneous questions in 
situation, he asked Why 
the children incomplete 


as the amount and kind 


: Bsc PCs ew Bit 
anisms essential to reasoning. vines 

i : c aan ich 
e in the complexity of the processes wh 


T i w Pig get 

at about this time, Jean H ings 

¥ p £ PA ye 

ardizing Burt’s tests for French an i 

i i ; er, thar 
intrigued by the children’s comments, however, t 


of reasoning in the les 
iaget (1926, 19 8) collected records of chi : 
fattiral situations, and, in the mise 
questions (Why does the wind blow?) and pa 
sentences to complete (The man fell off his bicye 
because... The pebble sank although... ), 
Piaget's interpretation of his m 
centrism and its cons 


aterial emphasized the young child's r, 
equences for thinking. Unlike the older child, the yang 
child of six or less does not communicate well. When a girl of oae oe 
asked: “Have you got a sister?” she replied: “Yes.” But the next qaen 
was: “And has she got a sister?” to which the reply was: “No. I am i j 
sister.” The difficulty of communication in this situation is not so much £ 
language difficulty as the difficulty of sceir 

he presence of another child may e 
take conversation because at this ; 
child's point of view 
munication. When he 
this reason the young 
function of because sp 
includes because, he m 
the young child is not 


ng herself from her sister’s poaa 
ncourage speech, but it is not age r 
age the child cannot appreciate the ui > 
and adapt his language to the requirements of vie 
makes a statement he has no need to justify it, and he 
child, before seven, seldom uses words having = 
ontaneously. When asked to finish a sentence 
isses the logical connection, This does not mean pio 
curious; it means that his search for reasons has not ye 

y satisfying answer will suffice. h 
e use of such logical onnectives as because, therefore, and althoug 
Increases with the child's 


Socialization, 
anguage molds thought, but bec 


t logical 


. . ay. jalities anc 
characterized by Juxtaposition of superficialities 


hen. 
means no more than and or and t 
his purpose as follows: 


uzanne; Edith is 
or Lili? 


Edith is fairer than S 


je 
larker than Lili. Which is the dar 
est, Edith, Suzanne, PESER At i 
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se girls in obvious classes, composed of fair girls 
or dark girls, but the problem depends on the relation darker than. The 
difficulty, according to Piaget, is that Edith can have one relation to Suzanne 
and simultancously another relation to Lili. 

As the child accommodates his playing, talking, and thinking to Suzanne’s 
point of view and also to Lili’s, and his mother’s, and his brother's, he 
moves toward thinking from everyone's point of view, or no one’s. Thus 
children’s answers to questions begin, after age eight, to look to impersonal 
principles, at first quite limited and concrete, for explanation. This develop- 
ment requires the accumulation of experience with specific objects and situa- 
tions, and facility with certain concepts, as well as language skill. As we 
difficulty of disjunctive concepts, the use of 
a rule, occurs later than the 
ngsters, having reached the 


The young child can put the 


would expect, knowing the 
although to express discordance or exception to 
use of because. After fourteen or so most you 
s, know about general principles and try to explain 
an event by stating one. They are also competent, at minimal levels, in 
If... then arguments. Thus reasoning, which started as a social technique, 
ration directed toward abstract generalizations. 

nts of reasoning are at variance with 
rpretation. The training study by 


stage of formal operation 


becomes an impersonal ope 
Some other developmental treatme 
the Geneva account, cither in data or inte 


Anderson (1965), as noted earlier, demonstrated that children of ten could be 
a generalization that Inhelder and Piaget 


trained to use a strategy for finding 
58) reached a similar conclusion from 


restrict to age fourteen. Wheeler (19 
factor analysis of reasoning tests of the Burt type given to children of differ- 
ent ages. Her data showed that children are capable of logical reasoning at 
a much earlier age than usually assumed and that there was a steady im- 
provement throughout the school period. She also found, like Burt, that the 
calculus of classes was achieved before the calculus of relations and proposi- 

hat operations with propositions 


tions. The British interpretation assumes t 
not because a transition from one 


come later than operations with classes, 
because there are more complex 


stage to another is involved, but simply 
operations to be performed. 

Clark (1969a) has an alternative interpretation for 
success on the three-term series problem. Instead of emphasizing the differ- 


ences between classes and relations, he asserts that children develop the 
semantically prior nominal sense of adjectives before the contrastive senses. 
r Betty is older than May to 


That is why a young child can take the sentence 5 
mean that Betty A younger than May. Both young and old are interpreted in 
a nominal sense, so that both sentences can mean Betty is different in age 

k with the reasoning 


from May. This interpretation follows from Clark’s wor 
of college students, but it is not consistent with the usual assumption that 


children acquire contrastive adjectives, like old and young, before they acquire 


the development of 


the c * . . 
he continuous dimension. 
If the development of reasoning appe 


explained, it should be noted that there 
the assumption that this development depends on the socialization of think- 


ing and especially on moving from one’s own point of view to that of another 


ars to be incompletely described and 
seems to be no disagreement with 
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and to that of everyone. The linguistic forms provided. by the foie 
play a larger role in this development (see Chapter 3) ates tne 3 na 
school allows, however. As to the question of stages, since erin in ore 
operations with propositions (which are more complex than re on 
hypotheses about discontinuities in the development of reasoning w 

into methodological difficulties at least as serious 
(see Chapter 2). 3 si 

Certainly, much will be learned as the carefully written problems, 

terminology, and the research designs of 
chapter are applied to development 
that the growth of the child's cap ee) 
is a gradual, quantitative development; and, if we assume that the a 
ing of the information in a problem takes place within the limits of shoi 7 
term memory (Miller, 1962), that the upper limit is continuously rising 
during the growth period. Thus, at a certain age a child may be able fa 
perform operations involving classes but not operations involving propositions, 
simply because the complexity of the | 
capacity for handling information. By 1 ea 
involve transformation of a Proposition—for example, from Edith is faire 


5 an 3 sen later: 
than Suzanne to Suzanne is darker than Edith—will be solved even lat 
We also know, however, th 


at specific concepts become easier to manipulate 
as they are learned more thoroughly, or overlearned. The meaning of nepr 
tive terms and the procedure of falsification are again rather specific acquis! 
tions. So it appears probable that a gener 


as those involving classe 


the experiments described in thi 

: ; aerate 
al questions. Probably all would agree 
acity for immediate or short-term memory 


cenit of DE 

atter approaches the limit of hat 
: sms tha 

the same a sumptions, problems the 


` : vapacity 
al continuous growth in capa k: 

i i i i sith discon- 

for memory, retrieval, and information processing goes along with dis 


‘ : a é Ss i ‘ances! Ò 
tinuous Jumps in specific acquisitions, depending on the contingentie 
schooling and experience. 


Major Trends and Implications 


Logic is formal and timeless, but reasoni 


that 
Proceeds step by step in measurable 


ng is a human enterpris¢ ing 
eas tape ; sasoning, 

periods of time. Deductive reason ia 
, 7 ’ cir : bleme Ge 
specifically, was considered in this chapter as the solution of problems 


. P . i Je “a 
cerned with the validity of inferences. These are not problems of invent ts 
we à i r D 
or productive thought, for alternative conclusions to deductive argume 
are usually obvious, They 


: sans accord 
š are problems of evaluation of conclusions ac hat 
ing to the criteria of formal validity. Hence the productive nt t 
CUTS Ci i 4 N iy i e 3 a 
ae consists of reorganization of the structure of the argument to faci o! 
evaliatiön: öf the alternative conclusions. The purely formal nature sar 
logical validity is a cultiva at evolved historically through ¢° 


ated notion th ion 
turies of debate and is achieved by cach individual through social interact 
and education, 


Deductive reasonin 
but the reasoning 
logic problems, es 


8 enters into the 
Processes are 


Pecially thos 


; N soblems: 
solution of all kinds of probl : 
rai ; Š a 5 on 

exhibited most prominently in the soluti alè 
€ systematically constructed by the manip 
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tion of content or structural variables. Experimental investigations have 
varied the content from purely symbolic, content-free arguments written in 
X, Y, and Z to arguments of the same structure written in continuous, mean- 
ingful sentences. The content of the meaningful arguments has varied from 
neutral to controversial and from patently true statements to patently false. 
The logical structures that have been studied most thoroughly are syllogisms, 
three-term series, and conditional statements. 

Thus, in the 1940s, research focused on man’s irrationality was able, by 
manipulation of the content of syllogisms, to obtain objective evidence of 
the biases that psychologists observed in their neighbors. Later, when cogni- 
r fashionable, more comprehensive research designs 
cts of the cognitive consistency of an argument 
as well as the emotional bias toward the content. When the content of the 
premises makes them clearly false or inconsistent, reasoning is slowed but, 
under certain conditions, ig more accurate. There is some evidence from 
experiments on attitude change that “covert deductive reasoning” accounts 
for the acceptance of conclusions not explicitly mentioned in the message. 

Even an unbiased thinker confronts many obstacles as he works his way 
through a logical discussion to a valid inference. Most of the terms that 
appear in a logical statement—such as some, is, or, If . . . then, and noi— 
he context of ordinary language. The logicians 
prune away the lively meanings, leaving the idealized structure, ts 
ordinary person retains the conversational connotations. Errors related to 
the logical structure of syllogisms and identified in logical terms were om 
a psychological explanation in terms o ts in the 1930s an 
in terms of invalid conversion in the 1950s. In the 1960s, differential paang 
reduced atmosphere errors but not conversion errors, and a broader interpre- 
tation of atmosphere effects encompassed the most frequent error ena 
leaving the error of invalid conversion with an ambiguous status. At ns 
same time, variations in the diffculty of certain syllogistic figures, i cee 
lumped together, were interpreted according to principles derived from 
paired-associate learning. 

Unlike syllogisms, three-term series problems are solved b = gl 
few seconds, but even so variations in time and errors throw some 8% 


i e spatially rep- 
the reasoning cesses. The hy vothesis that the terms ar y re 
pee be oe fc ly attractive, and objective 


resented in visual imagery has long been intuitive aay oe 
data in agreement with this hypothesis have recently been mg = 
problems that are easily visualized are more easily solved than t m a 
are not. Problems stated so that one premise must be rearranged to be 
aligned with the representation of the other in cognitive ao aeie 
dificult. But much of the empirical evidence on variations in di en ty be- 
tween problems can also be explained by assuming that the = premise 
is easier to line up with the first if the initial term of the secon premise is 
the subject of the sentence. And much of the evidence can also be explained 
by linguistic principles, particularly by assuming that the nominative sense 
of common adjectives is more readily processed than the contrastive senses. 


tive theorizing became 
were able to quantify the effe 


have previously appeared in tł 


f atmosphere effec 


y adults in a 
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Bes ie ete Asan ce stan 
Hence, this is one example of how psycholinguistics is beginning t 
ence, this Xa 1 
tribute to progress in general psychology. 
The plausibility of each of the hypothes are 
ictions "ach s S a ‘re are several sources o 
successful predictions of cach suggest that there are seve 
culty and that no single hypothesis will suffic 


and the 


for three term series 


- The agreement of one hod 
esis with the data on time and errors does not dispose of the others. Further 
more, if these diverse and subtle hypotheses are “apa 
little reasoning tasks, we should not expect simple explanations a 
complex problems. The hypotheses offered for syllogisms and for t a oer 
series problems have surprisingly little in common, but this might hs Ki 
from the research interests of psychologists rather than from complete diş 
parity among the reasoning processes. vpe : i n 
In deductive as in inductive reasoning, negative information causes me i 
trouble than positive because an additional transformation is required anc 
because the word not has avoid 
cance. Furthermore, the 
exception to a stateme 
discourse, which m 


plausible for such modest 


ant connotations beyond its logical signifi- 
logical significance of 
nt, depends on the 
any have been t f 
Inferences from conditional statements are also difficult, partly because 0} 
their hypothetical nature, partly because state 

have logical implications, partly be 


i rof an 
a negative statement, or of i 
Min TES 
assumption of a closed universe 
aught to question. 


ments that appear rien 
cause of the pervasive awkwardness r 
negative information, but above all because reasoning with puns s 
depends largely on falsification of the consequent, which runs counter to the 
normal propensity to describe things as they are, 
Research on deductive reasoning converges with 
ductive reasoning, especially on attribute and rule 
psychological difficulty, as well as the logical cfficiency, of falsifying a state- 
ment. The difficulty of negative information also applies generally, and so 
does the difficulty of switching from ordin 
But, aside from these few Seneralities, 
reasoning are specific to the 
Education 
disciplines, 


in- 
recent research on ; 
4 asize the 
learning, to emphasize t 


ary discourse to formal reasoning. 
Most of the difficulties of deductive 
Various logical forms. 

in logical reasoning resembles ec 
Beginning with the general | 
Special aims and limitations of formal re 


common fallacies, and methods of verifying inferences. He practices logical 
manipulations, logical analysis of ordinary statements and alternative ae 
tems for representing relations between statements, Methods of “programmet 
reasoning” have been devised that permit the thinke 

dictions in his own inferences. Such t 
amounts, when evaluated by 
cause most people do 7 
some people do app: 
comparisons there 


5 A systematic 
ducation in other cient 
anguage, the student learns s 

Š P structures, 
“soning, standard logical struct 


r to observe the meant 
raining is quite effective, even in oe 
performance on reasoning tests, probably ne 
not get such Practice in their routine activities. 
arently teach themse 


sa > in all 
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A á as ú > average 
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We have seen that cone 


j . a z a op ine 
ept learning, information seeking, and othe 
ch 
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materials are presented and as the subjects acquire skill in converting inter- 
esting content into content-free, formal statements. But this does not mean 
that content is unimportant or that the results thus obtained can be directly 
applied to real-life activities with meaningful content. Transfer from training 
in formal logic to reasoning about events headlined in the evening paper has 
typically been minimal. Yet it is possible to teach any topic with an emphasis 
on reasoning, or critical thinking as it is often called, rather than on the 
accumulation of facts. The reasoning tests constructed for factor analyses of 
reasoning supply models for tests of achievement in academic subjects. 
Children acquire simple reasoning skills in the normal course of develop- 
ment, but whether this development occurs gradually or by stages is a matter 
of controversy. Age differences are obvious, but it is questionable whether 
they are peculiar to logical operations or to general intellectual growth, 
whether they can only be acquired according to nature’s plan or can be ad- 
vanced by special schooling. It is not inconsistent to expect a continuous 


growth in the basic intellectual functions along with discontinuities resulting 


from experience with specific logical structures. 


CHAPTER 7 


PRODUCTIVE 
THINKING 


Creative Thinking 
Creativity Tests 
Predicting Creative Achievement 


Favorable Environments for Creative Work 
Processes and Conditions 
of Productive Thinking 


Training for Productive Thinking 
Production Controls 
Major Trends and Implications 


nlike the problems dealt with in previous 
chapters, there are also problems with solu- 
tions that cannot easily be categorized as 
successes and failures but rather are evalu 
inuous dimensions, such as appropriateness 
and originality. Productive thinking, conceived as con- 
structing, writing, or otherwise producing solutions, m 
be contrasted with finding the correct solution or reach 
ing the goal. The distinction drawn by Guilford (1956) 


: d A he 
etween convergent thinking, which zeros in on n 
F q ee i ; out- 
right answer, and divergent thinking, which moves 0 ; 
ward, is now 


generally recognized. The distinction 5 
blurred, to be sure, because traditional problem-solving 
experiments have often reported solutions that did no 
fall neatly into either success or failure ea 
inking is not well described by calling ' 
creative, divergent, Productive, original, or imaginative 
all these refer to intellectual activities with certain dis 
tinctive characteristics. ; 

In this division of the psychology of problem solvi: 
the defining characteristics reside in the solutions rathe 
than in the processes involved. Whereas in previous chap 
ters the correct solution was easily differentiated from 
272 
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the others, in the present chapter this will not be so easy. Here we must con- 
sider problems each of which has multiple solutions ‘that vary on several 
dimensions, Judgment of the solution plays an important part in the experi- 
mental methodology, and, if the thinker produces more than one solution 
judgment is part of his problem also. The most interesting solutions psycho- 
logically, and the most influential historically, are those that are judged crea- 
tive. In the previous chapter, the cold formal reasoning of the losician was 
the ideal; in the present chapter, it is the ardent thought of the creative genius, 


Creative Thinking 
The most dependable criterion of creative accomplishment is the judgment 
of history, The names of Beethoven, El Greco, Goethe, Einstein, and Freud 


are celebrated in the arts and sciences, not for feats of memory, prolonged 
gnments, but for out- 


errorless performance, nor efficiency on routine 
standing creative contributions that exemplify man at his intellectual best. 
In attracting disciples and making intellectual history, creative geniuses have 
also raised questions that mark out an area of sociological and psychological 
investigation. 

Creative thinking of such a high order is not common. Even the geniuses 
of history reached their creative heights rarely and spent much of their time 
claborating the brilliant ideas that came in their most creative moments. Most 
of them made a few mistakes, in their less creative moments, that embarrass 
their disciples and their biographers. But reknown as a creative genius is 
built on one or more outstanding achievements, not on freedom from mistakes, 
nor on average level of performance. 

Any intellectual achievement that is recorded in the p 
have been the joint consequence of a creative individual 
ditions in the intellectual environment. In some instances, 
biographers have been able to describe the intellectual conditions in an 
or science that gave a favorable welcome to a particular innovation. They tell 
us that the general illiteracy of the medieval ages encouraged the develop- 
and sculpture which tells a story. We know, for 
liefs that ornament medieval cathedrals, 


admired and imitated today, were put there to illustrate some now obscure 
conflict between clergy and laity. Students of the history of psychology usually 
try to describe these factors of the Zeitgeist that stimulated new developments, 
and the contemporary sociology of invention and innovation is a necessary 
complement to any account, like the present one, that emphasizes psycho- 


ages of history must 
and favorable con- 
historians and 
art 


ment of realistic painting 
example, that many of the bas-re 


logical factors. 

There is a large literature (Ghiselin, 1952; Koestler, 1964) of personal 
testimony about the conditions that influence creative output. This introspec- 
tive literature supplies background information and suggests some intriguing 
hypotheses, but the psychology of thinking has not lacked for intriguing hy- 
potheses as much as for objective data. If one approaches these ruminative 
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accounts inductively, one is impressed by the frequency with which ~~ 
thinkers, at an honorary banquet or the unveiling of a portrait, pale 
themselves with other great thinkers. Helmholtz recognized his own thoug ht 
processes in the reports of Goethe and Gauss, and many scientists hay & 
voked the image of Archimedes. Fascinating as these anecdotes may be, a 
cially to readers who also can recognize similarities to the workings of per 
minds in their own thinking, the difficulties with such evidence are oby ce 

There are, however, some solid biographical data on creative individuals o 
the past, so we shall begin with some individual differences. Previous chapters 
opened with the conditions that facilitate i eH 
to individual differences secondarily. But in dealing with creative thinking, 
information about creative individuals, identified by the criteria of outstand- 
ing accomplishment, is more definitiv 
than information about conditions 
great thinkers is caused partly 
sonalities—it is true 


and retard performance and came 


e, and a more secure starting point 
or processes. The preoccupation with 
by an intrinsic fascination with them as per 
» as far as such people are concerned, that there are no 
dull biographies—and partly by the hope of finding some generally useful 
information about the personal characteristics and environmental conditions 
that promote creative thinking. 


THE GENIUSES OF HISTORY 


The creative geniuses that we 
predominantly from middle- 


ioe ry age 
q : © we can divide mental age by chronological ag 
and get an IQ estimate of 200 for young John. A mental age of about oe 
‘ à er 
earned algebra at eight, so anot 


m 
an ase there are several such facts that Po 
= E the average of which is 190 for his childhood and 170 for hi 
Josi 5 The records of 300 eminent men and women have been evaluate 
in this way by 


; pane — for 
t three Psychologists working independently, and in cases 
which the records were ad 


ge 
i equate and the psychologists agreed, the some 
estimate can be taken as the minimum IQ that will account for the recorded 
achievements (Terman, 19 


26). 
Estimates of intelli 


Sence for those 
IQs. Goethe, who w 


igh 
Tote poetry in L 


with adequate records suggest very sg 
atin when he was eight, was scored 
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for his childhood and 200 for his youth. Others of interest are Pascal, the 
mathematician, 180; Voltaire, the French writer, 175; Mozart, the composer, 
155. In general, those who made their reputations as philosophers averaged 
170, the poets, novelists, and dramatists 160, and the scientists 155. > 
The other important fact that has turned up in the old records is that 
these eminent thinkers did their most distinguished work when they were 
young. There are exceptions, like William James, but a study of many 
histories of art, science, architecture, medicine, and other specialties by 
Lehman (1953) demonstrated that the age of creative achievement is lower 
than most people suspect (see Figs. 7.1 and 7.2). A study of the hundred 
most important inventions since 1800 (Alexander, 1945) disclosed that the 
average age of the inventors at the time was thirty-seven. This type of analysis 
‘eral reasons, but Lehman's generalization about 


has been criticized for sev 
high quality seems to hold up on further 


creative accomplishments of very 
examination (Botwinick, 1967). The grey-bearded gentlemen in the fading 
portraits on museum walls were not painted while they were achieving fame 
but after they had it made. 

The comparison between young 
figures indicate because the young men 
when they were burdened with occupational and domestic problems. Eminent 
thinkers of fifty have arrived. They are generally established—with facilities, 
f away many difficulties. These same 


and old is more unbalanced than the cold 
usually created their masterpieces 


assistants, and the prestige that smooths 
thinkers at thirty had to find and hold jobs, perhaps to move several times, 


and to compete with their peers for support and a place to work. Most had 
wives to entertain and children who wanted to play with them. But the 
abundant energy of the young and their enthusiasm for new ideas enabled 
some of them to surmount the typical hurdles of this stage of the life cycle. 


Their intellectual capacities show up, moreover, in learning as well as think- 
ist, for example, will often absorb and synthesize 


atible to older scientists. Such open-minded learn 
ater when his ideas are well organized into 
ideas. 


ing. An eager young scient 
two ideas that seem incomp: 


ing will be harder for him 1 
Cognitive svstems that resist penetration by new 
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Age 
t. Study of 60 histories of art disclosed that 
ce. More of these masterpieces were painted 
1 any other 5-year interval, but an 


-five and after seventy. From 


FIGURE 7.1, Age and creative achievemen! 
650 oil paintings ‘were mentioned at least t° 
when the artist was thirty-five and forty than i 
appreciable number were painted before twenty 
Lehman (1942). 
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FIGURE 7.2. Quantity and quality of creative achievement at different ages. The solid 
line shows the ages at which 182 famous philosophers first published their best eg a 
i.e., the one most frequently mentioned in the histories of philosophy. The pkey 5 
was constructed by tabulating the ages at which all the philosophical works of these 182 
philosophers were first published. From Lehman and Gamertsfelder (1942). 


When we ask about the personalities of great thinkers, the evidence F 
limited, but something can be learned from careful study of their lives ane 
works. R. B. Cattell (1963), who has analyzed much of the variance of per- 
sonality into 16 standard factors, read m 
of scientists and inventors to work out 
these standard factors. They were much 


course, but there were some interesting differences, On the average, ha 
scientists were schizothymic, that is, withdrawn, skeptical, internally pes 
occupied, precise, and critical. Despite this internal preoccupation, they n 
frequently dominant men, especially on intellectual matters. When Pasteu 
was director of a laboratory, he was much criticized for his authoritarian 
methods. The great scientists of the past, according to Cattell, were desurgent. 


a i 2 x S D incom- 
that is, introspective, restrained, brooding, and solemn, but this is not ince 
patible with a high level of resourcefulness 


As to the old question of genius 


cate any special relation; in fact, his estimates gave the scientists and inve Sat 
z oj a 
more ego strength and emotional stability than the average. But simi 


studies (Juda, 1953; Raskin, 1936; White, 1930) are consistent in uncovering 
more signs of psycho than 


among scientists, 


i i i ies 
any biographies and autobiographi 
f li ipti am on 
a generalized description of them : 
i 4 <¢ respects; o 
like other people in most respects 


and adaptability. indi- 
t id x 

and madness, Cattell’s study did not Y 

ntors 


pathology among artistic and literary geniuses 


DIMENSIONS OF CREATIVE ACHIEVEMENT 
The concept of creative achiey 


: ic, <i f great 
i ement comes from the idealization of $ 
men and their works. Afte 


ns 


à à ; igation begi 
r their praises have been sung, investigation DCS at 


i identificar: 4 3 th 
with the identification of a few dimensions that embody these ideals s° i: 
we can set up criteria for evaluating contemporary achievements. The 


2 à 3 > the ones 
mensions that psy chologists apply to creative achievement today are the 
that others, as far back as Plutarch 


` - are more 
i > applied, but the present methods are 
systematic. 


INTELLECTUAL L 


EADERSHIP 
Many of the great 


thinkers of the 


y for 
their own creative output, but for thei 


nl 
past are honored today, not en 5 
r influence on other thinkers. 
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new method of analysis, a promising theory, or an exciting art form inspires 
some bright young men and women of the next generation to continue this 
work, a trend has started. A master has found disciples. The creative scientist 
may sit back and wait for the judgment of history, but more often he tries to 
convince his peers of the value of his innovations. The radical philosopher, 
composer, or film maker who succeeds in cultivating an audience for his 
message is the one who is likely to acquire a reputation. 

Investigation of this dimension of creative achievement must wait, of 
course, for the trend to take shape. Critics, reviewers, and other judges may 
attempt to predict the life span of the intellectual flurries set in motion by the 


novel ideas of the present decade, but most do so with great caution. Style 
g, can easily confound the evaluation. Objec- 


of communication, for one thin 
ver, as soon as one person’s work is cited 


tive data begin to accumulate, howe 
or imitated by someone else. 


SENSITIVITY TO PROBLEMS . R 
A reputation for creative thinking can be made, at least in part, by asking 


questions, by identifying problems, and thus opening up a new area of 
thought. The creative thinker is a problem finder as well as a problem solver 
—he often perceives a challenging inconsistency where others perceive a 
satisfying order. On this dimension, intellectual leadership is much like 
political leadership: one has a fortunate start if the problems that challenge 
him and arouse his creative efforts are latent problems that arouse others 


also as soon as they are made explicit. 


ORIGINALITY . , . f , 
This dimension is often included in instructions to judges of creative 


achievements. An original solution to a problem is a novel one, not a copy 
of some other solution. In principle, this is an unambiguous dimension, but 
judges need experience with the other solutions in order to evaluate origi- 
nality. Arguments about originality have often followed applications for 


patents. 


INGENUITY 3 , iie buit 
i Ingenuity is often included or implied in the discussion of creativity, T 
it has been explicitly described by Flanagan (1963) as the invention or cis- 
covery of a solution to a problem in an unusually neat or surprising e 
The solution should be practically useful, more than just Don a A 
ing to Flanagan. It should be clever, not a solution that cou e arrived at 
by a logical routine. In the research context, cleverness is a synonym for 


Ingenuity. 


USEFULNESS be i 
The practicality of a solution to a problem can be ignored for some 
Purposes, as in pure science and fne art, but in applied research organizations 
it may be the most important dimension. Usefulness is not always obvious; 
expert judgment may be necessary here also. 
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oH afi n. 
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ayperience 
n contrast novels that sweep wide over aa ee ies 
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When En a i tests were first given to mental hospital a 
was noticed that the patients gave more unusual responses than To iat 
was the incentive for compilation of frequency tables, which we oog 
table is a more common response to chair and therefore peme T ponies 
normal than octopus. Poets, for example, avoid the old clichés, v. One 
when saying something said before, try to say it in an e i ‘capa 
psychologist, having administered a word association test to / a those 
reported that she gave “a higher proportion of unique responses ee 
of any one outside a mental institution [p. 491]” (MacKinnon, 19 se é (Ai 

Early studies of imagination collected work samples from children saše 
drews, 1930) and adults ÇI largreaves, 1927) and scored them as E a 
of frequency, and this procedure is still used today. It can be done oo epee 
with the aid of frequency tables, or as an impressionistic rating by © 
enced judges, 
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or two of those thought of as eminent today may be called creative geniuses 
by historians of the next century. 

Although discussions of creativity are usually focused on high-level achieve- 
ments, and the dimensions listed above are most clearly defined in reference to 
the high end of each, it is reasonable to assume that the dimensions are con- 
tinuous, and that all products and all individuals can be located somewhere 
along them. Thus, it is true that everyone is creative, but the truism does not 
flatter everyone. Everyone has some creativity just as everyone has some 
height and some intelligence; high, medium, or low. 


CRITERIA OF CREATIVE ACHIEVEMENT 

hinking requires a group of creative individuals, 
for contrast with other groups which are com- 
To identify the creative individuals, 
judgments, and work 


The study of creative t 
called a criterion group, 
parable in some respects but not creative. 
psychologists have collected product records, peer 
samples. 


PRODUCT RECORDS Rs ord: patents: oi ted 
Some creative achievements are matters of public record: patents granted, 


books published, paintings hung, musical compositions — the 
like. These products have been evaluated by apeeielieis, ma ke er FA 
sions they consider important, in competition with other products of the 


i inc i echani arts and sciences—the 
same kind, following the normal mechanisms of the arts ane me 
aluated Beethoven and Einstein and differen- 


tiated them from Jones and Smith. Anyone may gpa with: ane ia 
of editors and official commitees, but when large numbers are “i sig 
most will agree that the products accepted are mor a e tt 
than those rejected. The painting that remains covered in t ae ath 
and the manuscript that lies in the writer's bottom drawer sai $ nothing 
to the arts and sciences nor to the psychology of eel? oe at oe 
Fortunately for present purposes, psychologists eT eae, 
Spective and quantitative. A. survey of the prod ie k F087) Tius. 
conducted for the American Psycholog ation C jes j li i d 
‘in product records. The ie 2 ea 
to those members of the association who received bat rae Pa 
1930 and 1944. From these names, a list of 150 hig a a 
tained simply by counting publications noted in the Psyc gi s S, 


: H baii f interest 

a fairly comp ecord of publications © shane: 3 

names y a ne | to make Pist of 624 “highly visible” psychologists, and 
e = vere added al a 


age f 5 

Wace < ig genera i ; 
later a group of 601 randomly selected ers he gender ston 
cluded foe comparison. The publication records of these 1,225 psychologists 


ene EAI 
over a year period are shown in Fig. 7.9. r n 
four-year period a ibution of production records for 


The most obvious fact about the distri : A 
a Enr enai few contributed 40 or more publica- 
Psychology is that it is highly skew ed. A F 


tions durinø this time, while most contributed few if any. The picture would 
be: the ea ra ilies ets and sciences if comparable data were plotted. 


same social mechanisms that ev 
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FIGURE 7.3. Publication records over 


a four-year interval of 1,225 i 
© Se 8o To sa io ara who received doaia a 
Number of publications, degrees between 1 930 and 1944. 
1950-1953 from Clark (1957). 


PEER JUDGMENTS 

It is a common assumption that a work of i ks unléss 
but none of the geniuses of history got his name in the history books 
Someone put it there. A man’s reput: 
on a series of social evalu 
writer se 
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ation for creative accomplishment vee 
ations, first by his peers, later by his et ee 
nds a poem or a short Story to an editor, and the editor i ot 
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The use of peer judgments to assemble a criterion group hog om 
by MacKinnon’s (1962) Study of architects. He began by aking T 
fessors of architecture, i independently, to nominate the 4 ot 
creative architects in th States. If the nominators did not agre 


ra ted 


working 


e United 
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all, 200 names would have been submitted, but there were only 86. Of 
these 86, 13 were nominated by all 5 professors, and 40 were single nomina- 
tions each proposed by one member of the panel. Later, MacKinnon asked 
11 editors of the major American architectural journals to rate the creativity 
of the 64 architects invited to participate in the study. Still later, he asked 
the 40 who accepted the invitation to rate these 64 and found that the archi- 
tects’ ratings and the editors’ ratings were in close agreement, as indicated 
by a correlation of .88. A group of comparable, but less creative, architects 
was also assembled for contrast. One of the determinants of these peer judg- 
ments is suggested by the fact that during the 8-year period under considera- 
tion 97 articles were written by or about the creative architects, and only 
two by or about the others. 

It is not the purpose of this procedure to obtain a rating that will be fair 
to cach individual; that must wait for his obituary. The purpose is to develop 
a criterion group for research purposes, and a small amount of error can be 
tolerated, as in any research. A history of architecture written 50 years from 
ative accomplishment from 40 architects selected 


now will record more cr 
as creative by their peers today than from a control group, even though the 
reputations of a few individuals may shift up or down. 

Systematic errors are more serious. These occur when the judges nominate 
individuals who conform to a common stereotype of the creative thinker, 
or those who belong toa temporarily popular local movement or in-group. 


Established men of prestige may be preferred over younger men, or vice 


m nt a certain trend or school may be systemati- 


» or products that represe 
cally preferred over others. 


Some specific dat: he 
specific data on the i : res wy Ts 
the survey of America’s psychologists seca. steep patil 197A Lis 


of the names of 624 “highly visible” psychologists were sent to all persons 
a t with instructions to vote for the 25 who 


contributions to psychology as a science.” 


determinants of peer judgments emerge from 


Whose names appeared on them, 


had made “the most significant > i! sf 
he number of votes that an individual received was related, as one might 


expect, to the number of publications produced; the correlation was 43. But 
Publication is only the first step up the ladder; the nextistepils Lecogmikion by 
others. Those who deplore emphasis on quantity of publications will be glad 
to know that quantity was not enough. The number of times that a psycholo- 


+ ga i i rrelated only .36 with the 
Sists’ work was cited by another psychologist co nE. 
: the number of votes received correlated .67 


number of publications, whereas : x 
l i others. Evidently quality, as reflected in rec- 


with the number of citations by i th h 
Ognition by one’s peers, was considered more important here than quan- 


tity, 

‘An inden TER E EN advancement—number of offices held—was also 
included. Pisno to an APA office is based on diverse considerations, such 
as administrative competence and social sees 2° ey a PN 
and professional organizations. This index correlate = ro ar num- 

er of publications during the four-year interval hut a cs San Te 
ber of Gitions bydihërs: Apparently, the characteristics of an individual that 
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favor election to office overlap those that favor nomination for i 
significant contributions to psychology as a science, because the ns a 
between number of offices held and number of votes received was 4. ca 
The close relation between scientific reputation and citations by ot si 
was revealed again in a more recent analysis of psychologists ny 
(Meyers, 1970). A correlation of .68 between a citation index m a 
by peers was found, and a check of names of psychologists who had ; rae% 
medals, awards, and other tokens of scientific eminence in the past few yea 
disclosed that practically all had very high citation counts. This close relation 
is direct, in that scientists cite publications they h 
consider important, and indirect, in that they te 
have been frequently cited by others 
names they see and hear frequently 


ave personally read and 
nd to cite publications that 
and to give high ratings to those whos 
Cintellectual leadership). 

WORK SAMPLES 

Some of the difficulties of other criteria can be 


cx-alt wai 
avoided by the use of wor 
samples, intended to be represent 


ative of the work under consideration. an 
applicant for a typing job, for example, may be asked to type a letter since 
that is typical of the regular work to be done. 
samples should be clarified bi 
When we use a work sample 


es f work 
The logical status of wor 
rposes. 
ecause they are also used for other purpes 
ee E; Pear soretica 
as a criterion, we are not aiming at a theoretica 
lysis of perfi icti A se. We are 
analysis of performance, nor a prediction for future performance. k 
simply evaluating present performance on a sample of the regular wo 
. è ain 
required. As to creative work, we can ask people to write a short story, pai! 
a picture, put sticks and blocks toge! | 
addiy 
tivity can be manifested on signment, but appears most en 
jtuation ez e ete: ae i ‘ ssible 

when the situation calls for originality and the problem allows many po 

solutions. 


; "Mk: rea- 
ther, or design a research project. C 
almost any as 


work sample as 


Pa z investigator 
a criterion is that the investig 
roduction; 


variations in opportunities and orng 
. The work sample is not entirely typical, hawer 

time, and persistence of involvement in ie 
stressed in many biographies of creative thin wi 
f creative achievement is not to be expectec 
lated, of course, like other complex proar 
mav appear. When students write short = is 
Projects in an hour on request, the range of originality 1S 


M pi F 2 aa 
wide, and two judges with some experience can rate the products for 
with good agreement. Perform 


eate! 
fae ance on such assignments can then be trea e 
as a criterion of creative achievement against which other measures can 
compared. 

RELATIONS BETWEEN CRITERIA 


E ITE oat i died 
l ac complexity of the criterion problem emerges when it is — ich 
e i a i i ` 
seriously. It is a general truth of intellectual progress that anything “ 


Ic i 5 er: an 
voks simple at first glance becomes more complex on close Seat aa 
the criterion of creativity is no exception. Taylor, Smith, and Ghiselin (19° 
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collected data on 56 variables characteristic of 166 scientists—mostly physi- 
cists, mathematicians, electronic engineers, and chemists—in a large labora- 
tory at a government basic research center. The variables included research 
products; ratings made by laboratory chiefs, immediate supervisors, and peers 
on the same level in the organization; and indices of organizational status. No 
unitary criterion of scientific creativity resulted from any type of analysis, but 
several interesting factors emerged from factor analysis. 

One factor represented originality of work and thought, carried through to 
the stage where it appeared in patents and in research reports, judged by 
expert scientists in the same field to be original and significant. Another 
factor, coming from ratings by laboratory chiefs, represented the contribution 
of each scientist as viewed from higher levels in the organization, And a 
third factor, coming from ratings by supervisors, represented effectiveness in 
meeting immediate goals of a lower level. The negative loading of number 
of publications on this third factor illustrates the conflicts that a scientist 


faces as he tries to satisfy the supervisory level of the organization and, at 


the same time, to make a contribution to the scientific community outside 


the organization. 

The criterion arrived at in the survey 
Was more unitary because the demands and conflicts v e labor 
tory did not enter into it. Over half the psychologists worked in universities 
and colleges, but data on effectiveness in meeting the immediate goals of 
the local institution were not collected. No one asked whether the psycholo- 
gists were good teachers or good committeemen or whether they om in 
their grades on time, Judgments by peers at other er are based on 
evidence that goes beyond the intramural demands and pro ably represent a 
cleaner criterion of creative accomplishment. But even in the data on psy- 
chologists, there was a suspicion of tw n that the number of offices 


o factors i 
held showed only a low correlation with publication records. Psychologists 
Play dual roles as professional persons 17 the 


Association and as research 
evn A i ), Similar low correlations between 
sb according to Clark Le : 
Publicati m ae Anas sefulne: 
Publication records and ratings of u 


ss to the organization have been 
reported by Pelz and Andrews (1966) from 


of psychologists summarized above 
vithin a large labora- 


a large-scale survey of scientists 


in diverse organizations 
s anizations. 
2 adv york s; AS a 
Unfortunately, despite the apparent advantages of a et sangi as a 
criterion, there seems to be no evidence on the relation between this criterion 
: PAPS be asked to sit down and plan 


: an 
and the others mentioned above. Students : iginal psychological experi 
the landscaping for a new building or design an origina! ps} gical experi- 

g fora 


Ment, but established artists and scientists are usually too busy to undertake 


Such things se 
It is aseli] at this point to compare the treatment of the criterion question 
k P . . s . 
in the case of creativity with that of intelligence. Many intelligence tests 
s a 2 ‘ 
Were tried out in the early days of research, but the one devised by Binet was 
arly day ]. Further study with intelligence 


related t i iterion, success in schoo! 
o a social criterion, succ ‘ $ t 
tests clarified the nature of intelligence, and further correlations with social 


Criteria clarified the predictive validity of intelligence tests. We now have 
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teria intelliwence teste do and 

considerable objective knowledge of what criteria ce tes ak ine 
i 2 i igence to be willing to say tha 
do not predict. We know enough about intelligence t jae =. es jab 
if success on a job does not correlate with intelligence test sco’ > et 
i i 2 ame way, we can expect tha 

does not need much intelligence. In the same way, We | “i ae Ng 
criteria of creative achievement and the creative abilities that | 
criteria will be progressively clarified. 


DISTINCTIVE CHARACTERISTICS OF CREATIVE THINKERS 


When creative thinkers are identified by the methods ag ae 
and compared with others, a fairly consistent description eka 1 A Ek 
of architects by MacKinnon (1962) disclosed that the highly nee of 
inclined to have good opinions of themselves, as evidenced by ee ee 
favorable adjectives to describe themselves. They claimed to bg Sos an 
determined, independent, individualistic, enthusiastic, and ideti alabie: 
often than the others, who described themselves as responsible, u pes i 
clear thinking, and understanding. The more creative architects—a : elle 
revealed an openness to their own feelings and emotions, a sensi os le 
and wide-ranging interests, including many which in the American ¢ aliy 
are considered feminine. In fact, scores for femininity taken from personality 
tests correlated .49 with scores for creativity. T eommies 

On a figure preference scale, the creative architects preferred I 
asymmetrical designs, which they called vital and dynamic; the EBL ai for 
with creativity ratings was .48. On a word association test, the = T 
unusualness of associations correlated .50 with creativity. Each of the cres 
group had shown an early interest in drawing 
both parents of artistic temperament 

This kind of research is difficult to 
invited agreed to participate. The mx 


and painting, and had one oF 
and considerable skill. ie 
carry out. Only 40 of the 64 who we e 
Jst serious criticism is that the TE 
architects were successful and distinguished men when the study ies ial 
and they knew it. Anyone who acquires a reputation as a distingus sle 
thinker develops independence and confidence in his own opinions. ree 
listen to him. He will probably 


ann, ame [tle 
7 feel secure enough to mention som 
weaknesses; m 


any great men have permitted themselves this kind of ge 
down egotism. Tolstoi, in his old age, said that all he tried to do in v s tie 
Peace was to amuse his readers. Those who have lived uncreative lives Si 
little to offer except reliability and sincerity. This criticism does not questi 
the obtained differences, only the time and mode of their acquisition. nd 
acteristic of creative writers, according to pi 
arron, 1965), is, to no one’s surprise, their sae A 
fested in very high scores on the Terman saps 
est high-level achievement of abstract verba E 
Psychologists of concern for philosophical a ho- 
and skill in self-expression. The creative writers made high scores for psy The 
personality inventories, as compared with ordinary writers. 


; $ r entiona 
psychological interviewers noted their unusual ideas and unconvent 
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thought processes, but did not rate them at all pathological. On the con- 
trary, the creative writers impressed the psychologists as effective, productive 
persons, and they made high scores for ego strength on personality inventories. 
thoughtful characters who got things done, things that 
In other respects, such as independence, high 
were similar to other creative 


They were unusual, 
they themselves wanted to do. 
aspiration level, and nonconformity, they 


thinkers. 
Another study of this type is more limited but has certain methodological 
955) gave a personality test to three 


advantages. Cattell and Drevdahl (1 
categories of successful men: successful administrators, successful teachers 
CGincluding textbook writers), and creative scientists, 300 in all. As compared 


with the other groups, the eminent scientists were more schizothyme, self- 
sufficient, emotionally unstable (but no more than the norm for the general 
population), bohemian, radical, dominant, and paranoid. The self-sufficiency 
and avoidance of social contact that is characteristic of distinguished scientists 
appears in boyhood and thus cannot be a result of prestige acquired later in 
life (Terman, 1954). Although the Cattell and Drevdahl study included 
biologists, physicists, and psychologists, the results agree in many respects 
with the MacKinnon study of architects. ; 

A companion study (Drevdahl and Cattell, 1958) compared eminent artists 
and writers with the general norm group. The creative ones were more in- 
telligent, emotionally mature, dominant, adventurous, emotionally sensitive, 
bohemian, radical, self-sufficient, and energetic. These data refer to means of 
the creative group which probably differed from the general norms in educa- 
tion, income, and other respects; but still the results agree roughly with the 
results of the frst study, which included better control groups. l 

Both of these studies were done by mail, and thus they raise the question 
whether the returns were representative- Only half the artists and writers 
returned their tests, and one might suspect that those who agreed to expose 
themselves to the psychologist’s scrutiny would be dominant, adventurous, 


and self- i 

senner g accel psychologists (Clark, 1957) m directed at back- 
ground factors more than personality and motivation, ror confirm other 
E on eatly interests and productive contributions. 2 the ‘canny con- 
ttibutors to<Gentife psychology who received doadora ; = ae 1930 
and 1944, more than half had undergraduate majors te acne eee E the 
pared with only a third of the control group. And a ep ig artes : e 
slagiiReast eoneribitgrs, bit hardly any of the control group, had published 


at least one paper before age thirty. s 
Unfortun the research on artists $ A 
ately t ] weaknesses typical of studies of substan- 


tists, in spite of the methodologic 
f=} : è x 
ial elec phenomena in the real world, reveals a consistent picture. A 
good summary has been written by Taylor and Barron (1963), after review- 
ing several years of research, in the form of a list of traits found in study 
i i ntists. 
after study as characteristic of productive scie 1 


is scanty, but the research on scien- 
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1. A high degree of autonomy, self-sufficiency, self-direction. ee 
2. A preference for mental manipulations involving things rat! tha peor s 
a somewhat distant or detached attitude in interpersonal relations, 


i ai ially chal- 
preference for intellectually challenging situations rather than socially 
lenging ones. 


3. High ego strength and emotional stability. 
4. A liking for method, precision, exactness. . lin Bh 
5. A preference for such defense mechanisms as repression and isola 
dealing with affect and instinctual energies. i 
"O m PEY f onally toned con 
6. A high degree of personal dominance but a dislike of personally 
troversy. 1: 
£ g , ver-control : 
7. A high degree of control of impulse, amounting almost to over-co 
relatively little talkativeness, gregariousness, impulsiveness. 
8 


sy: Lae A B itive am- 
. A liking for abstract thinking, with considerable tolerance of cognitiv 


biguity. 
9. Marked independence of 
formity in thinking. 
10. Superior general intelligence. 
- An early, very broad interest in intellectual activities. 
12. A drive toward comprchensiveness and elegance in explanation. : If 
cays ; 7 TA itting onese 
13. A special interest in the kind of “wagering” which involves pitting he de 
against uncertain circumstances in which one’s own effort can be the 


ciding factor [pp. 385-386]. 


; dai s jard con- 
judgment, rejection of group pressures tow 


A more recent summary, including some additional data, by Cattell and 
Butcher (1968) agrees essentially with the above. These investigators, @ the 
comparing personality profiles of those who had achieved eminence in to 
sciences and in the arts and noting the rather small differences, went 0n 
comment: “It would almost seem as if the differences between science, att, 


and literature are differences of particular skills and interests only, and that 
the fundamental characteristic of the 
personality [p. 294].” ‘tues 
Lest the traits of creative thinkers seem like a universal catalog of aie 
it should be noted that creative thinkers are not, as far as we know, any eo 
agreeable, friendly, considerate, or tolerant than other people. They oe 
seldom described as happy and well-rounded, nor are they accused, any yee 
than others, of excessive modesty. Their errors of judgment in matters : m 
side their own specialties, though no more common than the errors of ordinar) 
citizens, are pointed out with glee : 
It is unfortunate that so much of the research has been limited to scier his 
The intellectual preference for things rather than people illustrates ai 
limitation because it is the socially challenging situations that have had 4 


è nes inno- 
analogous appeal for novelists and dramatists, and for another class of i 


vators: political, economic, and religious reformers. 
Nevertheless, this list of t 


typical adjustment of the cr 
exceptions to this generaliz 
to convert these generaliz: 
people of high ego strengt 
on the signs of creativity 


. ae 2. aw pe D 
creative, original person is a tvp 


è rars, S 
by these ordinary citizer „tists: 


raits offers a reasonably consistent profile oF o 
eative individual to his society. There are n 
ed description, of course, It would not be ee 
ations and infer, for example, that self-suffici ai 
h are unusually creative or to assume that practi 
will improve the substance. 
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Creativity Tests 

A very different approach to individual differences in creativity begins in 
the testing laboratory. Many tests of abilities relevant to productive thinking 
have been devised, but the most sophisticated and sustained research has 


been carried out by J. P. Guilford and his colleagues with tests of the open- 


end or completion type. Since the tests were designed for the production of 


uncommon, remote, and clever responses, many of them must be scored 
by expert judges, but this can be done with satisfactory reliability after 
brief training. The Guilford group has been concerned with the construction 
of tests and the analysis of their interrelations; the task of correlating the tests 


with other variables has been taken up by others. 


TESTS AND FACTORS 
A factor analysis of 46 tests, including many of the open-end, divergent 
thinking type, by Wilson, Guilford, and Christensen (1953), extracted 16 
factors, one of which was identified as an originality factor. Some of the best 
tests of this factor will be briefly described because they have had a dominant 
influence on subsequent research and have often been called tests of creativity. 
Plot Titles presented the plot of a story and asked the subject to write titles 
for it. The rating of these titles for cleverness was the best test of originality, 
. d 
in terms of factor loadings. : 
Quick Responses was a word association test scored for uncommonness. 
ie aa ie , P ores to common responses and high 
Objectiy ely derived weights gave low sc p! & 
Scores to uncommon responses: : 
Figure Concepts consisted of 20 simple drawings of eae = individuals. 
The subject's task was to find qualities or features suggested by 2 or more draw- 
ings; these also were scored for uncommo 
Unusual Uses listed familiar objects, su 


nness. 
ich as a newspaper, with instructions 


al ones. The more unusual uses 


to write uses for them other than the usu 
teceived highe 
er scores. 
g $ 
Remote Associations presented two words separated by a blank space, such 
o write on the line between 


as Indian _______ money, with instructions t ked 
them a word that associates the two: The words were ae sacl that the 
Y socia y 
right answer, which might be per") was a remote a s e of bo 
Remote Consequences required the subject : par t 5 =n aarp of 
Certain unexpected events, such as the sudden abolition of all national and 
local laws. The number of remote consequences a counted for one score 
' i nother. 
and the number of immediate consequences for a eh a 
Several of these tests were scored for sheer number . wor e podna a 
that ara be differentiated from the vall known factor 
Ess mc san yi factor could 
mple word fluency. mas 
ae recent 7 analysis of the contribution of productive thinking 
y feld, Guilford, Christensen, and Frick, 


abilities to problem solving (Merri 
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1962) identified four divergent production factors and described them in a 
revised terminology: ideational fluency, the production of many semantic 
units or ideas appropriate in meaning in response to a given idea; spontaneous 
flexibility, the production of many categories of ideas or semantic classes; as- 
sociational fluency, the production of many related ideas; and originality, the 
production of a variety of changes of interpretation, neither immediate or 
obvious, that are appropriate to a general requirement. Approximately the 
same factors have been identified at the junior-high-school level (Guilford, 
Merrifield, and Cox, 1961), and, with specially adapted tests, at the sixth- 
grade level (Merrifield, Guilford, and Gershon, 1963). A further systematic 
development of this program (Hendricks, Guilford, and Hoepfner, 1969) is 
the construction of many tests of creative social intelligence and the analysis 
of these into six divergent-thinking j 
havioral ideas in quantity and variety 

Another test of this type, developed by Mednick (1962) and called the 
Remote Associates Test, has also been used in recent research. Three words 
were given, such as surprise, line, birthday, and the subject was asked to find 
a fourth word associated with these three, This is very similar to Guilford’s 
test of remote associations except for the theory behind its construction. 
Whereas most people have a Strong association between a stimulus word and 
the common stereotyped response word associated with it and only weak 
associations with the more uncommon response words, Mednick assumed 
that the more creative people have flatter associative distributions, that is, 
less Strong associations with the common response words, and stronger as- 
sociations with uncommon response words. This flat distribution of associates 
ep ar more creative individuals to produce the relatively 

ssary for success on these test items. 

The Torrance Tests of Creative Thin king (Torrance, 1965) employ similar 
oe to get measures of fluency, flexibility, and originality with verbal 
conc pte cn ney a Si os d 

teens Cate ‘i isi ted by the names of a few: Asking ee 
Tees, ne © sig onsequences, Product Improvement, Unus iia 
research in hated and Figure Completion. A large share as 

Use oF the ral et has made use of these tests. sdti 
misunderstanding, They a or tests of this nature has ied to gos it 
home et i aad Soe samples of work that is obviously were 
achievement, Ne ae = : taken as a self-evident criterion of Metn 
creative achievement dre we en be taken, without proof, as pre ae 
question. The tests Ka bens y for this purpose is a troublesome eap well 
TebSSenbed in Pa ae ie the study of Productive abilities Oe iid 
and mislabeling, filo Intelligence tests, and, despite A E 
ation. Individus] disen on tasks such as these deserves serious “salve 
in some detail; the oe, E i on these tests have been ey de- 
pendent variables, nats a os used as predictor variables an « eis 
true that some of che in pre effects of experimental manipulations: aa 

rather trivial, especially when the dimens 


abilities involving the generation of be- 
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of breadth is recalled, but the hope here, as in other areas of psychology, is 
that general principles will be found which will be applicable to more sub- 


stantial problems. 


CORRELATES OF CREATIVITY TEST SCORES 

Aside from definitions, labels, and good intentions, the meaning of scores 
on creativity tests will be progressively clarified as their relations with other 
Scores are examined. The data at hand include comparisons with conventional 
measures of intelligence and achievement, personality, and specially con- 


structed laboratory tasks. 


INTELLIGENCE n , , 
If creativity tests are to be useful, they must be different from intelligence 


tests, It is not just that intelligence tests have a well-worn place in the Psy- 
chologist's bag of tricks. Intelligence test scores are fairly stable, adminis- 
tration is standardized, and norms are available. Their uses and limitations are 
fairly well understood, especially in educational situations. When creativity 
tests are used as intelligence tests are used, do they supply additional in- 
formation? 

The correlations between scores on tests of intelligence and tests of 
creativity vary considerably, depending on the specific tests used, the methods 
of testing, and the samples tested. The range of ability in the sample tested 
is most important because the correlation between any two abilities is reduced 


when the variation of either is restricted, as in special schools for superior 
amples. Low correlations have been reported for 
a i 


-average children (Cicirelli, 1965; Flescher, 1963; 
bs d 


Getzels and Jackson, 1962), but correlations of between 55 and .77 have 
been obtained in a broader sample of 175 Scottish children with mean IQ of 


102 CHasan and Butcher, 1966). Correlations from —.40 to above .60, with 
a median of 20, have been obtained with the Torrance Tests (Torrance, 


1967), 


children and in college s 
Testricted samples of above 


In samples of college students, Mednick and Andrews (1967) obtained 
correlations in the .30's and .40's between various intelligence tests and the 
Remote Associat Test and the correlation of .48 between the same test and 

ssociates 7 Laughlin (1967) is similar. 


rman Concept Mastery Test re orted by 
In tl N b; P p, 
‘or? ger correlations wou d 
general popu ation of co lege age. somewhat larg 


bee i à 
Xpected, r d in discussions of creative thinking that in- 
Although it is commonly asserte > 
6 a here seems to be no empirical sup- 


x i t 
telli is i only up to a point, : Beart 
yore eel gaat ay P mias that the relation between intelligence 

òo 


o í â 
a T oe hing a tests is anything but linear. Scores for creativity, 
erformance on creativity i cs se: 
Od rae oe achievement obtained from 609 pupils in a suburban sixth 

i $ u a su 1 
Stade (Cici 1, 1965) indicated mostly linear relations, with little evidence 
et ects. Since Mednick and Andrews had a 


or interacti threshold eff : ‘ 
large PA Lait college freshmen, they were able to examine this ques- 
» fo} 


: Pa > a vels in respect to scores on the Scholas- 
tion by dividing their sample into 5 le P as 
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tic Aptitude Test. They found that the correlations with A ee 
Associates Test were about the same at all levels of intelligence. Low a 
relations are easily confused with nonlinear correlations, both conceptually 
irically. ae 
eo ee analyses have been directed, not at separating a 
ability from general intelligence, but at analyzing both of these into po 
factors. Nevertheless, it would be logical to assume that tests of divergen 
thinking would be grouped on one factor and tests of convergent ing 
which constitute a large part of general intelligence, on another factor. ; 
check this assumption, Cropley (1966) gave tests of both types ioan z 
selected sample of children, and, though the analysis identifed the diviga 
and convergent factors, the correlation between them was .51. As Crop d 
concluded, divergent and convergent abilities can be factorially differentiated, 
but they are not independent. Sa 
In addition to the questions which these findings pose for factor analysis È 

a method, they suggest that some of the variance attributed to creativity scores 


x z P 3 P S GLY: 
could just as well be attributed to intelligence. Tests of creativity are relatively 


jai : -end 
new. The objective tests may not be pure, and the scoring of the open 


tests requires time and training. Hence correlations between tests of age 
are often as low as the correlations with intelligence (Barron, 1955, EN 
Cline, Richards, and Abe, 1962; Cline, Richards, and Needham, DeD 
Flesher, 1963; Getzels and Jackson, 1962). On the other hand, a study we 
scientists in several South African research institutes found moderately high 
intercorrelations among creativity tests and quite low correlations with a 


5 veces : : əneral state- 
conventional intelligence test (Shapiro, 1968). Evidently, a general a 
ment of the relation between 


4 4 ons ae is not 
intelligence and creativity test scores ! 
possible at this time. Improv 


k ras F ‘ sJations: 
ements in reliability will raise correl 
purification by elimination of noncreativity variance will lower them. 


Wallach and Kogan (1965) tried to take account of all these methodological 
considerations when designing new creativity tests for children. They Fell 
their tests individually, in a gamelike situation with plenty of time. a 
me all the ways in which a potato and a carrot are alike.” The tests were TA 
for unusualness, with good reliability. For a sample of 151 fifth-grade chile ag 
from professional and managerial families, the average intercorrelation amori 
the creativity measures was .40, among the intelligence measures, 5, ae 
between a composite score—derived from an intelligence test and one 
tests of educational achievement—and creativity only .09. Wallach and mone 
(1969) got similar results with 500 high-school graduates. As a measure na 
intelligence, they used scores for verbal and mathematical achievement o 
the Scholastic Aptitude Test. These scores were practically independent 


: ó 5 ah verba 
scores for fluency of ideas and uniqueness of ideas in working with 
and figural materials. 


EDUCATIONAL ACHIEVEMENT 


aes sen clo 
Historically, conceptually, and empirically, intelligence has been € pe 
related to educational achievement, and the recent emphasis in some qua 


A i x mAs z jnter 
on creative thinking is intended to broaden both. The argument is that 


sely 
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ligence tests should be supplemented by creativity tests, and that the conven- 
tional learning of right answers should be supplemented by learning to 
produce original ideas. If all goes well, school grades will someday include a 
just measure of creative accomplishment and therefore will be predictable by 
creativity test scores as well as by intelligence test scores. At present, creativity 
tests cannot be expected to predict scores on standard achievement tests 
unless such achievement is somehow facilitated by productive thinking or the 
creativity scores are impure measures containing some noncreative convergent 
abilities. Both explanations are tenable, since the data are sparse. 

| with average IQ of 132, Getzels and Jackson 
(1962) selected a group of students with high scores on creativity tests but 
ligence (mean IQ of 127) for contrast with a 
lligence (mean IQ of 150) but low scores 
nts of those called high-creatives, as 


In a special secondary schoo 


relatively low scores on intel 
group with very high scores on inte 
on creativity. The educational achiev 
well as those with high IQ, were definitely superior to the school average. 
The teachers enjoyed working with the high-IQ children but not the high- 
creatives, Similar results were reported for some other schools by Torrance 


(1962, 1967), and Cline, Richards, and Abe (1962) also reported rather high 
ding correlations reported by other 


correlations with grades. But the corresponding 
investigators (Cicirelli, 1965; Edwards and Tyler, 1965; Flescher, 1963; Wal- 
lach and Kogan, 1965; Wallach and Wing, 1969) have been low. There are 


also contradictions between the published reports which may clear up when 
Separate school subjects are studied. Cline, Richards, and Needham (1963), 
for example, treated science grades separately and raon ~ cs pel the 

i : wi ontributed a li > 
highest correlation, but that the creativity tests also c k oe e to the 
multiple correlation. In this school, incidentally, scence teachers did not dis- 
like those who got high scores on creativity tests. it ‘ 

Since teachers have been accused of disliking creative chii ren, : e results 
obtained by Hasan and Butcher (1966) are particularly or gine AK Their 
Scotti x p ; «ty were liked by the teachers 
Scottish children of high IQ and low creativity 3 


more than the children of low IQ and high creativity, as Getzels and Jackson 
those high on both measures. The 


had r : are 
lad reported, but the best-liked group were s 
simple explanation of all the data, according tO Hasan and Butcher, is that 
teachers dislike children of low 1Q. 


Jeme: 


YLE i 
d on the basis of performance on 


they rate in other respects that may 
“and interests. A study of 100 
cially informative because the 
for creativity, primarily based on Guilford’s 

was statistically eliminated 


J : ë 
rbal intelligence a 
as thus identified, were characterized 


ed as follows: 


PERSONALITY AND COGNITIVE sT 
When individuals have been score 
Creativity tests, it is instructive to See how 
disclose their intellectual habits, motivation, 
Air Force captains by Barron (1957) is espe 
men were given a composite score 
sons then the component due to ve 
Y partial correlation. The high-creatives, s 
Y several groups of traits, which may be summariz 


ispositi : sy imuli 
Disposition toward integration of diverse stimu 


Energy, fluent output, involvement 
Personal dominance and self-assertion 
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Responsiveness to impulse and emotion 
General effectiveness of performance 
Expressed femininity of interests 


Since the correlation between creativity scores and intelligence scores rs 
Wallach and Kogan (1965) research was low, they were able to oe 
151 fifth-grade children into 4 groups: high on both; high on sag on 
and relatively low on creativity; high on creativity and relatively an 2 
intelligence; ‘and relatively low on both. Without regard to any ea 
achievements, they found rich variations in modes of thinking and adju “4 
ment to the school situation. The results for girls and boys were — e 
many respects and had to be analyzed separately; a few selections iron E 
abundance of detailed comparisons will illustrate the trends of the sine 

The girls high on both variables were self-confident, sociable, and intere K 
in academic work. In a boring classroom they were eager to propose nove’, 


. payee ae Sele y were 
divergent possibilities; hence, from the teacher's side of the desk, they 
noticeable for their disruptive 


Girls high on creativit 
the classroom; not confi 
ship of others, nor did o 


» attention-seeking behavior. of 
y but low on intelligence were at a disadvantage : 
dent in themselves, they did not seek the companio! 
thers seek them. They too were often disruptive, not s€ 
much from enthusiasm as from protest against their plight. ‘dion 

The low-low group seemed to get along better. They were more con a 
and outgoing in social relations, perhaps in compensation for poor acade 
achievement. 


The girls high in intelligence but low 


in creativity were oriented to 
academic achievement 


» not distracted by new ideas, and quite suited we 
usual school situation, They were cool and cautious in social behavior, bt 
were liked by others anyway. 

The same analysis for the boys y 


ielded less clear-cut differences. 
e cognitive styles for cat 


egorizing strategies of these children were = 
closed by Sorting tests. Pictures of 50 objects—such as a rake, a screwdris i 
a telephone, a lamppost, and a candle on 
put together w! 


was between th 
similariti 


—were displayed with instructi here 
nat seemed to belong together, The important contrast a 
ematic reasons for grouping and reasons based on aan 4 
es among the objects. Wallach and Kogan give the example i bel 
comb, a watch, a pocketbook, and a door grouped thematically under the rts 

“getting ready to go out.” Conceptual groupings were based on abstrac = 
similarities, with labels such as “hard objects” and “objects used for ae 
It has usually been assumed, as noted in Chapter 2, that sorting on a acon 
basis represents an intellectually inferior level of development, and the a 
of high intelligence and low Creativity avoided this kind of grouping in n 
of conceptual grouping. The highly creative boys, however, made use of in 
thematic and conceptual grouping; they seemed to be able to switch Her a 
from one to the other, while those of high intelligence and low creativity 


i auch 
seemed to be locked in conceptual categorizing. These contrasts were " 
less clear for the girls. 


y 
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These studies of psychometric creativity, which started out to differentiate 
an exciting new variable from general intelligence, have not yet progressed 
very far. In the Wallach and Kogan study, as in most others, the correlations 
of other variables with creativity scores were low in comparison to the cor- 
relations with intelligence. In the words of one reviewer (Shulman, 1966): 
“Whether due to lower reliability or less relevance, the newly defined creativity 
variable is far less productive than the old workhorse, intelligence [p. 307].” 


SPECIFIC LABORATORY TASKS , 
Perhaps more research by the same methods will clear up the present rather 


diffuse picture of the correlates of creativity test performance, but the sharper 
methods of controlled experimentation may also be necessary. For example, 


the information yield from laboratory experiments can often be augmented if 
c vo or three levels according to their scores 


the subjects are grouped first into tv 
ffect is significant, the results give more 


on an ability test. When the levels e 


Specific meaning to the test scores. 
The Remote Associates Test correlates with measures of verbal productivity 


(Mednick, Mednick, and Jung; 1964), as one would expect; but Mendelsohn 
and Griswold (1964) went further and used scores on this test to divide col- 
lege students into three levels of creativity for comparison with their perform- 
ance in solving anagrams, after focal and peripheral exposure to the solution 
words. A list of words that included solutions to some of the anagrams was 
read and memorized in advance while a tape recorder was playing another 


list. The subjects of high-creativity level solved more anagrams with solution 
> as well as focally), as compared with 


words previously exposed (peripherally he TE 
another list not previously exposed, than those of lower-creativity level. 4 he 
3 h scorers deploy their attention more 


best interpretation seems to be that the high | ; 
widely and thus receive a broader range of information; they screen out less of 


the incidental material. This breadth of the high scorers in attention to, and 
recall of, seen and heard words corresponds with their production of remote 
3 a 


associates o ae 

ates on the creativity test. ius : : 

i j; a i 67) in an $ 

a similar design was employed by Laughlin ea rga i ss teas like 

those descri P 9, the formation oF wor classes from relevant 
describe: hapter 2, 0n $ : 

heal 3 hap s given lists of 6 words and asked 


and irrelevz : llege students were 
evant words. College s$ ; " 

to “figure out the concept or way in which four of the six words go together. 
d a 


Later, they were asked to recall the other two words or > — wi ex- 
emplified. For example, given the list: globe, | would by m r , eta raut, 
and baseball, learning the 4 that are round per 5 l Ten it ts ae 
and learning the 2 that are sour would be incidentia ae sa order 
to discover the 4 relevant words, the subject had a ig et FT aia 
words, but he could then forget them. However, when the subjects were 


es ó > A s on the Remote Associates Te: 
divided into 3 levels according to their score: st, 


those who scored high did better on the incidental task, both in recalling the 
5 than those who scored low. A parallel 


Words ; s h i 

‘ and in stating the concept, à 

division by levels of verbal intelligence did n9t separate performance on the 
all familiar, so vocabulary size 


incidental task so sharply- The words were 
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was not important. What was important was the ability to si, PENT ge 
utilize remote associations—the ability that is involved in the cet ; 
sociates Test. Overall, performance on incidental concept formation corre me 
.35 with scores on this test and .20 with scores on the Terman se 
Mastery Test. But, as in most of the correlation studies mentioned car me 
the correlation between the 2 tests was larger, in this case .48, than the cor 
relation with other performances. f aadi mi 

Corresponding results came from an experiment (Riegel, Riegel, a X 
Levine, 1966) which divided people up according to composite scores noms 
creative personality test and the Remote Associates Test, Those who 2 
high produced more different responses on logical tasks, such as naming 
superordinate and subordinate classes, but fewer different responses on we 
fralogical associations and simple grammatical tasks, than those who ea 
low. In producing words from semantic classes, the more creative had hetter 
control of their production or better adaptation to the task, in that oe 
separated their restricted associations from their free associations more clear r 

These recent studies of the intellectual processes related to creativity test 
performance are limited in scope and the obtained differences are small. They 
point, however, to a common ability—in 
solution, and in the formation and utiliz 
tentional use of words and 
to adapt them to the re 
coincide with the 
Test was dev 
Kaplan, 


a verbal creativity test, in anagram 
ation of word classes—to make in- 
associations originally acquired incidentally P 
quirements of the present task. This ability does as 
theoretical assumptions on which the Remote ani gael 
eloped, but recent research (Jacobson, et al., 1968; Olczak i 
1969) has failed to support these assumptions. This test seems 
measure convergent production more than divergent (Guilford, 1967). 


Predicting Creative Achievement 


~ s of 
Predictions of beh the 


that 


, . . ally prediction 

avior are notoriously inaccurate, especially predict t 

socially signif. . . i ive achievement, 
socially Significant behavior, and in the case of creative achievem¢e 

criteria to be predicted appear to be complex. We have noted, eae past 

one trait, high general intelligence, characterizes creative thinkers © ra 
r . i A . 3 arec! 

and present, hence it should be enlightening to follow the adult een by 

highly intelligent children. Such a long-term study was started in oa 

s ji: ae As yho He 

erman (Terman and Oden, 1958) with about 1,500 boys and girls “t ildren 

IQs of 140 or higher; the average IQ at ten was 152. Now that these chi 


i onts 
a : - ae achievemen 
have grown Up, records of books, poems, patents, and similar achiev we 
show that their creative oy aha 


oy itput has been far above average. It is true, 
critics have noted, 


: ‘lies an 

: that they came mostlv from middle-class familic chil- 

started life with middle-class advantages. But it is also true that these lli- 
f ea 

dren were picked out of the i 


vast middle class by their scores on the m 
elect group, most individuals had little Ta hile 
> many intelligent people lead happy, wort F 

achievements that would not be called crea 


gence test. Even in this s 
accomplishment to boast of 
lives and attain success by 


tive 


ive 
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Although there is abundant evidence for a relation between intelligence 
and creativity, intelligence is not so useful a predictor as one might expect. 
Since a college education, and often additional graduate or professional train- 
ing, is a prerequisite for entrance into most of the occupations that have 
been examined, the prediction question arises only for samples of individuals 
enough to complete considerable formal education. Even so, 
vomen mathematicians scored significantly higher 
astery Test than representative women mathema- 
1965). In most studies, however, when 
the predictive value of scores 


with intelligence 
a small sample of creative v 
on the Terman Concept M 
ticians, all well educated (Barron, 
advanced education is the first selection device, 
ge. 

ducation is one predictor of creative 
and it is effective, not only because 
e of the special interest in the 


on a test of general intelligence is not lar. 
Completion of the necessary formal e 
achievement, especially in the sciences; 
of the contribution of intelligence but becaus l 
materials, Few people go to art school unless they enjoy art. No one can 
endure graduate training in chemistry or psychology unless he enjoys at 
least a portion of it. A psychological study of the successful dropouts, those 
who made significant creative achievements without the customary formal 
education, would be particularly informative, but the small number of such 
cases prevents it. For the same reason, a study of those individuals who have 
made creative contributions to new fields for which formal educational prep- 
aration has not been widely available, such as directing films, would be in- 
formative. In the common fields of work, completion of formal education 
means that the whole sample is characterized by adequate intelligence, edu- 
cation, and interest in the field. Hence three potentially good predictors have 
already been used as selection devices. The problem now is to improve 
prediction among individuals who are already well screened. ii 
If we are willing to take work samples as a criterion, some vali lity a s 
ficients reported by Jacobsen and Asher (1963) seem quite promising. < he 
“sted of four 15-minute problems, such as improving office 

foreigners to visit the United States. The authors neo- 
Iving emphasizes cognitive flexibility, cn their 
tests include perceptual materials (such as a series at as ‘a = ha oaf 
changes to a dog) and a good score represents ae to shift ion eo 
Perception to the other. The scores on these ye r ea y bigh 
Correlations with performances 0n the work samples, although the predictors 


bal. 


work samples cons sted 
desks and encouraging 
field theory of problem so 


were perceptual and the criteria ver . . . 
5 J i common V r Ji 
Since tests of creativity seem to have something in common vith creative 


achievement, many attempts have been made to predict pe achievement 
from performance on creativity tests- The results have not been impressive 
except in a few small samples (Taylor and Holland, os oe en- 
couraging results with creativity tests come = ae i y es allach 
and Wing (1969) conducted by mail. About 500 hig h-school. gra uates, who 
had been admitted to college and were willing to participate in the research, 
reported their high-school extracurricular aeomp eae ny several cate- 
gories: leadership, art, social service, writing, ramatic arts, music, and 
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i e, At the same time, they took tests of productivity of ideas and oi" 
av ae The creativity tests differentiated quite well between ee Wa 
kinh pies low nonacademic achievement in art, a par prec A 
ingly leadership. The test of productivity rather than the m ne of iai 
separated those of high achievement outside the classroom se ne pore 
achievement. Thus the authors’ interpretation emphasized ideationé 
or “cognitive vitality” rather than novelty as such. PN T 

Some success in prediction has been achieved also th exple ator) 
with printed personality tests (Cattell and Butcher, 1968 ee 

The most consistently successful predictors are biographical pills P Tee 
because interests, work habits, and other requirements for creative ac ; 
ment develop carly and remain fairly stable 
(1968) demonstrated this st 
for identifying creative 


over time. Schaefer and Anas 7 
ability by constructing a biographical aa A 
high-school boys, using as a criterion eacha ipa 
ations supported by actual creative products. With large samples, a j e 
validation, they were able to attain validity coeficients of .64 for 4 

achievement and .35 for s 


jaence was nobdirectly 
cientific achievements. Intelligence was not 
controlled, but criterion 


ades 
and control groups were matched on seköal os 
and socioeconomic levels. A companion study (Anastasi and nano 5 
of high school girls vielded coefficients of 34 for artistic achievement ¢ 
for literary achievement. 
Looking back at the dis 
the creative students came 


E hat 
riminating items of the inventory, it is evident tha. 
from families of intellectual interests and tenant 
oriented more toward reading than sports. Even as children they so up late 
absorbed in their hobbies, to the extent of skipping meals and staying UI 
in pursuit of their own intellectu 
controls, 


‘ re than the 
al goals. The creative students, more t 


nvironmental diversity, having aera m 
al experiences. Like the geniuses of pie 
more unconventionality, less social tite 
an the other students. In general, the no wa 
correlates of creativity are quite similar for boys and girls, for po their 
scientists, artists, and. writers. But the creative boys were closer tO 


heir 
a i : irls were closer to t 
mothers than to their fathers, while the creative girls were close 

fathers. 


The life styles 
lege, Richards, 
samples, that ex 


had been exposed to e 
often and enjoved more unusu 
creative students showed 
and more daydreaming th 


ore 
the 
ion, 


sop injo eol 
A ET srsist into 
developed in grade school and high school persi 


sth large 
Holland, and Lutz (1967) have demonstrated, with #è 


: from 
: : ; : dicted 
tracurricular achievements in college can be pre tive scien- 
z a ‘ i a . cure: a * 
extracurricular achievements in high school. Their measures of crea dent 
tific achievement c 


onsisted of self-reports, such as: did an or 
scientific experiment, won a prize or award for scientific work, ha college 
published in a Scientific journal. The correlations from high school ‘4d 24 for 
were .31 and 22 for male and female college freshmen and 40 an relations 
male and female college sophomores. Corresponding predictive et 50’ 
for artistic achievements and writing achievements were in the .40's an sement: 
These correlations were quite specific to the different fields of ae radarit 
indicating an early channeling of productive interests and efforts. 
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grades did not correlate well with these extracurricular college achievements. 
However, a printed test, called the Preconscious Activity Scale, made up of 
items such as “I often daydream about unsolved problems,” was related to 
achievement in art, writing, speech, and drama in high school, but not in 
science (Holland and Baird, 1968). 

The next step, prediction of achievement in adult jobs, is the big one. 
Certainly there is some stability of interests and abilities, but the predictive 
may be lost if the sample includes only college graduates or 
graduates from special training programs. Nevertheless, predictions of scien- 
tific achievement are approaching the accuracy necessary for practical ap- 
anizations where the competition for scientists 
eir output justifies the statistical work necessary 
to develop a specific inventory. Smith, Albright, Glennon, and Owens (1961), 
for example, developed an inventory of personal history items for the identi- 
fication of petroleum scientists, using patents and ratings as criteria, and suc- 
ients of concurrent validity, with large samples, above 
selected personnel. The most creative in this 
had what the authors called an “academic 
colleagues in intellectual 


value of these 


plication in large research org 
and the commercial value of th 


ceeded in getting coeflic 
50, even among these highly 
large oil company laboratory 
orientation,” in that thev resembled their university 
interests and working habits. 
Similar biographical inventories h 
pharmaceutical scientists, yielding validity te idatio 
in the .30’s (Buel, 1965; Tucker, Cline, and Schmitt, 1967). The discrimin- 
sonality profile not unlike that of the 
in this chapter. 
have been developed for specific practi- 
cal purposes in the selection of high-level personnel, some generality has been 
reported. The keys constructed for petroleum scientists have reasonably good 
validity for pharmaceutical scientists also Buel, Albright, and Glennon, 
1966). The profile of the industrial research scientist 1n British laboratories, 
outlined by biographical data, resembles that of their Arean a 
in gencral, though there are some differences in the particulars haney, 


ave been constructed for large samples of 
coefficients, after cross-validation, 


ating inventory items sketch a per: 
Creative people who appeared earlier 
Although biographical inventories 


1966). , ; ? 

After reviewing many of the attempts to predict creative achievement, 
š À i z 

Taylor and Holland (1964) wrote the following summary: 

by class rather than by specific scales or inventories, we 

pet's rue dering Tega, Hn Es gad and aper 

most efficient predictors. Self-ratings an Ieee SApS i 

tions are ome Originality and personality ee tee Capt bask reser 

Aptitude and intelligence mease = ee finally by parental attitudes 

Tange corrections are really applicable), followed finally by p 


[p. 41]. 


If we look at predictors 


Biographical data inventories do not explain creative achievement. By a 
Strict definition they do not even predict, since the measures reported are 
Measures of concurrent validity, of success in identifying scientists already 


; : itiv -image, may be k 
productive, and some traits, such as a positive self-image, \ a con. 
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ire ardento intellectual 
sequence as well as a predictor. But they do give Eua of ee, whieh 
; i r r early in life 
i zi abits, < ife styles acquired rather carly in life, wh 
orientations, working habits, and life T 1 se agen er E 
ingui > creative f TS similar d va 
distinguish the more creative from others of sin 


Favorable Environments for Creative Work 


itions is just 
i favorable worki conditions 15 J 
Every employer knows that arranging favorable working c cea aa 
as important as selecting capable individuals, but in the case o e ; irs poet 
i: š ee ae P W availa > evider 
less effort has been expended in this direction. The available evide 
mostly from surve 


iaie en 
vs of accomplishments and complaints of aiia 
contrasting the university setting with the large research a The 
1956; Pelz and Andrews, 1966; Taylor, Smith, and Ghiselin, l ie p 
optimal condition, in general terms, is one which the individual a ao 
working habits match the goals and working conditions of the n a 
that pays his salary. Scientists working in governmental ais a a the 
ratories typically complain of the pressure of the immediate gos “ads that 
organization, inability to pick their own problems and follow ric ea sublish 
turn up, and lack of recognition for their work. They would prefer pasa 
under their own names so that their colleagues and children will oe ac: 
their accomplishments. Some large laboratories are organized, ois a oe 
cident or design, to conflict with the working habits and demands 
dividualistic creative thinkers, 
In medical research also these 


<niiere 
‘ r Sri, ~ ore seve 
intellectual freedom and the goals of the organization appear m 


; and 
: cake ; sdical schools an! 
large hospitals and health organizations than they are in medical 

universities (Gordon, M 


) nes sature 15 
arquis, and Anderson, 1962). The criticul etic? 
the specificity and immediacy of an organization’s research goals in 
to the creative thinker's longterm, 
In the universities, the 


or 


i Tere B 
å investigator's desire 
conflicts between the investigator in 


inner-directed motivation. he lack of 
scientists’ complaints usually stress the dustrial 
facilities and technical support that they might enjoy in a large ir 
laboratory and the daily pressure of other organizational goals. yeraliza” 
Complaints are not ‘good data on which to base psychological a where 
tions; but scientists, like other people, tend to inhabit the environmen ductive 
they feel most comfortable. Thus scientists highly motivated for Fox 
research prefer the university setting, and more creative scientific 
does come out of university laboratories, by and large. jiscovered 
The survey of psychologists referred to earlier (Clark, 1957) dis people 
that the significant contributors care more about the opinions of top F joa 
in their field, while “psychologists in general” care more about the i were 
of local colleagues than their immediate supervisors. When psycholog “explicit 
asked about an ideal Setting, the significant contributors were quite 5 
in what they 


sere? fantasies» 
wanted. The paradise of creative psychologists Be 
sketched by Clark, follows: 


as 


kers 
ino iei a a ing co-won 
The setting is in a university or is at least a place with shinies where the 
where there is an Opportunity to consult experts in other fields an 
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researcher is free to choose his own kind of research problem; the researcher 
wants enough time, that is, he wants to be released from other activities and 
he wants to have enough money to do the sorts of things he chooses to do. 
Much more popular than a group setting is a setting where one has freedom to 
choose one’s own work and where he has independence [p. 89]. 


By contrast, dependable information about the conditions of artistic pro- 
S, composers, writers, and other creative artists do 
ons where their output can be rated by super- 
visors. Many flee when an inquisitive psychologist approaches. And un- 
fortunately, the reminiscences of artists and writers, after they have become 
| than psychological value. We do know, however, 
that there is a great variation in “ideal” settings. Painters and sculptors have 
often constructed their own environments, to meet their own requirements. 
Scholarly writing, linked to library facilities, has frequently come out of the 
universities, but imaginative writing, linked to the writer's personal dreams 
and frustrations, has been produced in stimulating cities, in pastoral retreats, 
and even in suburban homes after a hard day at the office. 

Genuine creative achievements are not usually produced on demand under 
controlled conditions, so empirical investigations of favorable and unfavorable 
environments have had to turn to creativity tests, or similar tasks—which for 
this purpose are dependent variables rather than predictor variables. Pieper 
(1965), for example, looked into the effects of the physical environment of 
office workers on creativity test performance. One setting was a semidark, 
monotonously furnished office, in which the subject had no stimulation but 
his memories; another was a typical everyday office. The third ran an EA 
stimulating setting, with pictures on the walls Cincluding o ee ar to 
but not identical with those of the test), pleasant furniture, i pleasant 
lighting; the subject was encouraged to walk around ee revs from 
the typical chair-bound posture and enjoy the delights n tha bee percent a 
he worked. The experiment, conducted in a mere actor mi i ae 
ment personnel as the subjects. found ifference between these 


a significant € 
i e e surr inos and tł 
situations, presumably because the richness of oh ec ue 
freedom of movment aided the imagination. Simi ar! y s ie s Š 
allowed to handle objects as simple as a paper clip 


an children who could only look at them 


duction is meager; painter: 
not gather in large organizati 


eminent, have more literary 


level children who were 
Produced more unusual uses th 
(Goodnow, 1969). 

The university setting, 
Special stresses, which stimu AE By 
(1968) studied the effects of these frustrations 9) 


gradu; aa Í business to estimate their frustration level immediately after 
graduate school of business i eiastically anticipating the best, and a few 
enthusiasicea™™ 


ken their first examinations. On the second 
nee given and, as expected, those students 
vere g 

frustration made the lowest scores. The 


nees test W 
ion of remote consequences, which 


its general advantages, arouses its own 
dividuals and frustrate others. Hinton 


asking students in a 


despite 
late some In 


entrance, when most were 
months later, after they had ta 
Occasion, three creativity tests Yi 
who reported the most increase in 


i s ~onseque! 
differential effects on the conseq" 
e the product 


vere especially convincing since 


frustration seemed to reduc 
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are the creative ones, but had no effect on the production of immediate 
uences. 
a ee environment that discourages creative thinking is another 
of the many things teachers are blamed for, and it appears that this environ- 
ment can be analyzed more readily than most others. Torrance (196+) has 
collected considerable evidence from questionnaires and checklists of elastin 
activities to show that the atmosphere in most classrooms is inimical to ~~ 
thinking and that many teachers can, although with some difficulty, oe 
toward more encouraging practices, This is an important factor, as we sha 
see, in the outcome of training programs for creative thinking. 
Such environmental variables deserve more serious attention than they 
have received so far; some at least can be experimentally manipulated. T he 
results of the experiments outlined above, if duplicated in other settings, 


would have important theoretical and practical consequences. 


Processes and Conditions of Productive Thinking 


This chapter began with the asse 
accomplishments are produced 
Now we must try to describe 
them 


rtion that the processes by which creative 
are matters for research rather than acs 
these processes and the conditions that influente 
, making some use of our knowledge of creative thinkers and their id 
vironments but depending chiefly on labor foe 
no one believes that solutions produced in the laboratory at the request oF ° ë 
experimenter deserve to be called creative, it is the achievements of ge 
thinkers and the fascination of hypothetical creative processes that ha 


F > ; . > subjects 
stimulated laboratory experiments. The principal ingredient that the sub) 
of the laboratory expe 


atorv manipulations. Al 


riments lack is, of course, the persistent peronai i 
volvement that appears so prominently in the biographies of creative thin j 
The two dimensions of creative achievement that laboratory measures ka 
spicuously lack are breadth of achievement and intellectual leadership. Wit ie 
these limitations, the aim of laboratory research is to derive some gene" 
principles that will apply broadly to the more intellectual aspects of at 
thinking. Admitting that the achievements of the Nobel prize winner are 


j es 
; ; erenc 
from those of the average taxpayer, we can still assume that the diff 
in thought processes are quantitative 


Simple verbal assignments illustr 
variety of specifications. Any 
given class when requested: 


creative 


rather than qualitative. ds to 4 
ate man’s ability to produce wor Pi a 
literate adult can casily produce words ie” 
“Write down the names of some automobi 


P the 
“List some vegetables.” The results give evidence for achievement O tion: 
: Ji ae P c 
class as well as ability to limit output to this class in response to instrue 

When more complex 


a 

è Sale : spec! 

$ Products are required, productive thinking is a a ait 

kind of problem solving, and all the Processes described in Chapter ts a 

occur before the thinker is satished with his efforts. But the attemP ihe 
: Bae MBAS ; ro 

analyzing productive thinking have concentrated on the production P 


i" 4 vary" 
presenting problems that encourage abundant production of solutions a? 
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ing the conditions of production. Alcohol in the bloodstream, for example, sets 
up a condition that has differential effects on production processes and thus 
offers a method of analysis. In one experiment (Hartocollis and Johnson, 
1956), the production of simple responses (Name words that begin with M) 
declined after alcohol, and so did convergent thinking on a vocabulary test of 
the completion type; but the production of words from a fairly large class of 
words (Name trees) increased slightly. Examination of the words produced, 
such as shoetree and family tree, suggested that the experimental subjects did 
not adhere to the specifications for the responses as strictly as the control 
subjects. This relaxation of standards encouraged moderately restricted re- 
sponses but not fluent production of unrestricted responses nor production of 
the one right answer. 

In order to study production processes per se, some experiments have 
climinated any judgment or evaluation of the solutions produced; others, 
however, have included the judgment process in order to describe the overall 
enterprise more completely. When the problems are not too complex, produc- 
tion and judgment can be separately identified and timed. An experiment with 

i hich preparation, production, and judgment 


three plot title problems in w i ; 
were separated illustrates this method of analysis (Johnson and Jennings, 
1963), The time spent reading the plot, projected on a screen, was considered 


preparation time. When the subject was ready, he wrote five titles to the plot, 
and the time spent on this operation was called production time. The time 


spent in selecting the best of these five titles as the final solution his con 
e of the data for college students are shown in 


solutions consumed the most time, as one 


might expect, but the important point is that those un pana — 
Supplying solutions to one problem did the same with another problem. 
And the same consistency is apparent for preparation and judgment. In 
ween times spent on the same process in different 
while the cross-correlations between different 


sidered judgment time. Som 
Table 7.1. The production of five 


general, correlations bet 
problems were fairly high, 
proc 


es were lower. . i sad 
These processes are differentiated by individual pa ce E on 
cach and by function in the whole enterprise, but they a Srdepengent: 


NDS) FOR THREE PROCESSES 


TABL q TIME (IN SECO 
E 7.1. MEAN TIME (1M son LE PROBLEMS, 


INVOLVED IN SOLVING PLO S 
eaithEN TIM 
AND CORRELATIONS BETWEEN TIN 


Mean Correlation 


Mean 

dg. 
Process Time Prep. Prod. Judg 
Preparation 48 69 85 
Production 198 40 pe a 
Judgment 45 .40 : : 


From Johnson and Jennings (1963). 
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Preparation does not just mean reading the problem; it consists oe 
the information and getting ready to produce solutions to speci RR 
this experiment, the reading time was longer for preparing 2 l A 
titles than preparing to judge titles that others had produced. The pr nat af 
process is the most obvious one: it appears to be the central compone oe 
productive thinking and thus has received the most experimental aloe? 
Judgment, although it has usually been treated in SS B 
as a separate topic related to psychophysics, consists, in these experiments, 
evaluating the products just produced. 


PREPARATION FOR PRODUCTIVE THINKING 
The results of experiments on the 
seeking summarized in pre 
there is a special question 
Some eminent thinkers ha 


preparation process and on m E 
vious chapters apply also to the present eae 
about optimal information for productive pions 
ve recommended a thorough study of the prob 

and of previous attempts to solve it 


i ave recom- 
; others, equally eminent, have re 
mended that we ignore past fai 


‘ 3 atl This old 
lures and maintain a fresh approach. TI he 
question was reexamined by Hyman (1961 ), but he 


av 
emphasized what may 
more important, the 


attitude—or set—toward information about a problem. 
Working in a large electrical manuf. 
to study attempts already made 
an automatic warehouse, One 
in order to make 


acturing plant, Hyman asked ed 
to design a system for recognizing EE 
group studied these previous attempts A 
up a list of faults; another group studied them constructive’) 

in order to make up a list of useful fe 


cubhiects were 
atures. Later, when all subject 
asked to propose t 


s m 
heir own solutions to this problem, and to another saat 
involving practical uses of the pyro-electric effect, those who had s$ 
constructively produced better solutions to both. sion. He 

A parallel study by Torrance (1964) reached a parallel ee age y 
asked psychology students to read two A 
critically or imaginatively, 
develop an original ide 
of the term, Here ag 
received Superior ra 
periments for te 


articles in psychological journa ad 18 
before the middle of the term. Then se day 
a, theory, or hypothesis and turn it in on the ie y 
ain, the products of those who had read DOE in ae 
tings for originality. The implications of these 

aching are obvious. 


QUALITY AND QUAN’ 
Much of the interest 
can be traced to the brai 
a book with the aus; 
on the basis of th 
enthusiastically, 


TITY OF SOLUTIONS 

among psychologists in the quality-quantit 1953) in 
nstorming procedure described by Osborn (partly 
Picious title Applied Imagination. He advised peop ° ` ideas 
© psychological research then available) to think is in a 
as many as possible, without criticism, and to do Lom an 
group so that each member could build on the ideas of others. Ce 
other suggestions to get thought moving were provided. Although ae ndising 
ing was promoted as a Practical technique for the solution of mercha educa 
and industrial Problems, it has Spread to other areas, as distant as 


y relation 
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tional and military problem solving, and has led to significant psychological 
research on productive thinking. 

The brainstorming procedure has two essential components, which may be 
ach worthy of serious investigation. The 
improves the production of ideas. The 
uctive than individual thinking, has 
ve cannot stray far into social psy- 


stated as psychological hypotheses, e 
first is that avoidance of criticism 
sece z inki i 

ond, that group thinking is more prod 


a long and controversial history. Since v 
chology here, we can only note that careful research (Dunnette, Campbell, 


and Jaasted, 1963; Taylor, Berry, and Block, 1958) has cast considerable 
doubt on the social side of the brainstorming procedure. When individuals 
working alone and in groups have been equated, those working alone have 
produced as many different ideas as those working in groups. 

The other side of brainstorming, the hypothesis that production of ideas 
goes better without criticism, is more plausible. Courses in creative problem 
solving, built around brainstorming, emphasize quantity rather than quality 
and advocate the postponement of judgment so as to free the thinker from 
the curse of premature self-criticism. The outcomes of brainstorming will be 
discussed systematically in a later section, along with ee a eats 
But, aside from the results of training, investigation of the re a ng 
quality and quantity of solutions is beginning to throw some ight on the 
processes by which the solutions are produced and on the conditions that 
affect these processes. ‘dere b 

As dependent variables, three measures must be a ered: the a er 
of solutions, the mean quality of these, and the number o! superior solutions. 


All solutions are rated by trained judges or scored objectively; those above an 
d high-quality or supertor products. These three mea- 
igh-qualit} 


arbitrary level are calle ; 
all see, but each may be an appropriate answer to a 


Sures overlap, as we sh 
different question. 


ONE SOLUTION OR MANY? s . , iv 
Experiments on problem solving typically require the subject to give one 


solution to a problem; experiments on productive nia anp, i a e 
> . $ P 7: 7 
tradition, require many solutions. A series of experiments by Johnson, Parrott, 


and Stratton (1968), directed toward analysis of am as WAN as — 
: i i ‘een writing one solu- 
cluded the basic comparison between writing 


»—and writing many. Since tasks as simple 
n criticized as trivial and vulnerable to 
tantial problems that could offer some 
ferred. The plot title problem—which 
r according to Wilson, Guilford, and 
] and four others of the same 
] trends as well as differ- 


ment of outcomes, in 
tion—ealled the standard condition 
as writing uses for a brick have bee 
Variations in instructions, More subs 
challenge to college students were pre! 
is a good test of the originality factor 0n 
Chilstenssy (1953)—was taken as a MO le 
type were constructed in the hope of finding genera 


ences between problems. 


plot of a story or movie, was presented 


Plot Ti ivi h 

itles. A paragraph, giving E i 
x K: . 4 it. 
With instructions to write alternative titles for 
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i ins i to 
i isti vas with instructions 
Table Titles. A table of agricultural statistics was presented w 
write titles for it. 


ial 
i iag senti 4 ires for socia 
Conclusions. A chart, with column diagrams representing expenditu 


i i ions th: ay be drawn 
welfare, was presented with instructions to write conclusions that may 
from the chart. 


i > printed with instruc- 
Sentences. Four words, happy, expensive, horse, lake, were printed with 
tions to write sentences containing these words. 


g g dai »moved from 
Cartoon Captions. A cartoon in four squares, with the printing remo 


ih 7 5 mite alternative: quotes for 
the last square, was presented with instructions to write alternative qu 
the last square [p. 4]. 


The solutions, about 1,000 for cach problem, were rated independently 
by 2 judges with close agreement, and those above the 90th percentile w 
considered superior solutions. 7 odieselisital 

As compared to the standard single-solution condition, the multiples sli 
condition resulted in two distinct differences: a decrease in the mOPN T sse 
of the solutions, and an increase in the number of superior solutions. T = 
results are not contradictory. Subjects instructed to write many soluti 
wrote about seven, on the average, of 
increase in quantity included an incr 
mediocre and inferior solutions. This 
of solutions is scrutinized, as in Fig. 
solutions can be 
made no refere 


. s the 
a wide range of quality, and sal es 
rease in superior solutions as we 


is obvious when the complete amare 
7.4. The increase in number of “ion 
attributed to a pure quantity effect, since the instructi 
nee to criticism, quality, or postponement of judgment. 


INTRAINDIVIDUAL VARIABILITY 


IN QUALITY OF SOLUTIONS 
A fundamental characteristic 


of productive thinking under multi 
tion instructions is the variation in quality of the solutions produced 

subject. It is conceivable that some subjects write mostly inferior so 
some mostly mediocre solutions 
were true, | 


ple-soltr 
by each 
Jutions, 
If this 


rin sehor ions. 
„and others mostly superior soluti eal 


the superior solutions would come predominantly from 
number of subjects; but this 
duction records in the experiment above. Almost every subject, wilt ] some 
mean quality of his solutions, wrote some solutions of higher quality and 5 


„tion 
a > productio! 
of lower quality than the group mean. About 2 percent of the pr o to dts 
records actually extended through the whole scale of quality, from 4 One 
shown in Fig. 


. ah a, 1 

7.4. The median range was about 7 points on this “ene ay 

important consequence of this intraindividual variability was that oe ae 
subject who was encouraged to w = 
of writing one or two superior ones. n nusual 
Although comparable production records for simpler tasks (such as u large 
uses) have not been made public panis snces 
intraindividual variabil ai 
will follow. 


ee tan] prO 
Ha g io rj ridual | 

assumption is contradicted by indiv ae ahi 

atever 


¥ è r 
rite many solutions had a goo! 


, it is a fair assumption that the 
ity will be observed, and that the same con 


VARIATIONS IN INSTRUCTIONS . 
The emphasis that the thinker puts on quality relative to quantit 

in an unknown degree, on previous training; but it is also valine 

instructions and working conditions. Meadow, Parnes, and Rees 


ds, 
depe” 
y is ð 
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Group 2 
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100 


Sop 


2-3 4-5 6-7 8-9 10-11 42-14 


Rating 
FIGURE 7.4. Complete distributions of ratings of solutions to 5 problems by 40 subjects 
who wrote one solution (Group 1) and 40 subjects who wrote many solutions (Group 2). 


The base line represents the sum of 2 independent ratings ona poe ah Odd and 5 
even sums have been combined to remove irregularities, except that the highest ratings, 12, 


13, and 14, have been placed in ] interval. From Johnson, Parrott, and Stratton (1968). 


y giving alternate sets to students during 
ys 

task was to produce unusual uses, 
tions were: “Brainstorm to your fullest ability; 
We are going to count only quantity on this 
Il” The other instructions were: 


examined the effect of instructions b 
a course in creative problem solving. The 
and the brainstorming instruc 
forget about quality entirely. 
test. |. . Quality is of no concern at a P i W 
“Forget all about brainstorming- Strive completely S quality. Pa e to 
see how many good ideas you can produce ina ah cg of time. You 
iss E -dez S ze 
öte voinena ba penalized for any bad ideas [p- 414].” When the responses were 
ire going to be penalized’ |’. identified, it was the group with the brain- 
rated and the superior solutions identihee, g 


3 : t. And this effect is not limited 
Storming instructions that had produced the most. An A : ia, 
= ‘lity, for brainstorming instructions improved the 
; 


dofa distribution of subjects (identified by scores 


to a restricted range of ab 
ell as the bottom third (Turner and Rains, 


Performance of the top thir 
on a pretest of creativity), aS W 
1965). 2 

i Weisskopf-Joelson and Eliseo (1961) als 
tions, but their method of analysi 
task was to think of attractive names for a 
chandise, which were then rated for <a 
instructions emphasizing quantity the quantity 


o compared two types of instruc- 
‘ferent conclusion. The subjects’ 
r a new deodorant, and other mer- 


increased, but most of the 
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increase was in the low-quality responses, not the high-quality ones. Klean 
quality was higher under instructions to be critical. ERE EE 
Specific instructions about solutions are not the only factor s aE and 
difference; the social atmosphere generated by the examiner's instrue yai 
demeanor also influence production. Without varying the monen s 
the solutions, Dentler and Mackler (1964) arranged a relaxed, safe td do 
dition (the examiner was friendly and said he knew the subjects ees Sal 
well on the test) for comparison with an “unsafe” condition point of 
was serious and distant while stressing the challenge and ome 
the test). As expected, more unusual uses were produced under 1g ae 
conditions. The results of the Hartocollis and Johnson (1956) Ee a 
tioned above suggest that alcohol dissolves the “unsafe” constraints a cians 
the subjects to produce ideas as if they were “safe.” Similar instructiona S ol 
have been obtained in a ninth-grade classroom with tests of unusual — 
consequences (Adams, 1968). Instructions emphasizing competition depr 
performance compared to that of 
ing freedom from compe 
formance. It is bec 


a control group, while instructions gee 
tition and open receptivity to ideas a a 
ause they appreciate these effects of an insecure on have 
atmosphere that careful experimenters, like Wallach and Kogan (1965), 
administered their tests in an informal, gamelike situation. , swe iboi 
The most complete investigation of the effects of specific cage ar 
solutions to be produced comes from Gerlach, Schutz, Baker, and a 
(1964), with six sets of instructions, including penalty for bad respi 


) x a ig 
. P f : “criteria-cued 
brainstorming, nonbrainstorming, and what they called “criteria-c 


y hat 
f s P sh was tl 
structions, The task was to list uses for a coat hanger. Their bonen ning 2 

A P A x : > tt oar o 
the Improvement attributed to brainstorming was really due to lea 


in- 
e in t MAR mea iteria-cued i 
criterion of quality, so they spelled out the criterion in the criteria ir score 

s a SO EDGY, Spe : é joher you 
structions: “The more imaginative or creative your ideas, the higher y valuable 
will be. Each idea w 


: : i wE v 
ill be scored in terms of how unique itis... how 
i 


Its 
the more original and creative the better [p. 80].” As the ee at 
turned out, the criteria-cued instructions did yield the largest FE itd 
superior responses, slightly a three 

The instruction rated 
tests of unusu 


more than the regular brainstorming inst | 
al effects revealed by Manske and Davis (1968) a 
al uses were even more clearly distinguished, since pas Feet 
responses on “practicality” as well as “originality” dimensions. When n less 
to “be original,” the subjects gave uses that were more original e more 
practical; when instructed to “be practical,” they gave uses that ue were 
practical but less original. However, they were unable to give see 1 strong 
both original and Practical. In fact, over all conditions there we eae 
negative correlation, — .80, between the ratings on originality and pra does 25 
dimensions. This negative correlation depends, of course, on the were 
well as the responses. But it does suggest that in these rather een situa 
situations the subjects trade off one dimension for another as, in othe 
tions, they trade off accuracy for speed. 

The experiment by Johnson, Parrott, and Stratton (1968) re 
effects of information about the criteria of good solutions with mo 
stantial problems. The information given was as follows. 


verified the 


sub" 
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Plot Titles. By clever we mean an imaginative, creative, or unusual title 


for this plot. 
Table Titles. A good title is a comprehensive one that includes the impor- 


tant points concisely. 
Conclusions. A good conclusion would be a valid generalization which 


integrates the chart as a whole. 
Sentences. A good sentence reads smoothly; the four words fit unobtrusively 


into the structure of the sentence. 

Cartoon Captions. A clever quote is an imaginative idea that fits the 

cartoon [p. 4]. 

ite evidence of the effects of criteria-cued instructions 

Those who were given the information about 
fewer solutions, but of higher mean 

ber of superior solutions than those who 

To be specific, 80 subjects with 

hich was 12 percent of their 


The results gave defin 
on solutions to the 5 problems. 
the criteria of good solutions wrote 
quality, and contributed a larger num 
did not have the benefits of this information. 
criteria cues wrote 281 superior solutions, w. 
total output of solutions; while 80 subjects without such cues wrote 226 
superior solutions, which was 8 percent of their output. 

Even with information about the criteria of good solutions, however, those 
who wrote many solutions did not produce solutions that were as good, in 
respect to mean ratings, as those who wrote only one. (They produced a 
larger number of superior solutions, of course, because of the quantity effect). 
But if criteria information improves the quality of solutions in the multiple- 
solution condition, why not let subjects in the single-solution condition have 
the benefit of criteria information also? An experiment (Johnson, 1968) which 
allowed 4 minutes to write one solution to a problem yielded saras of 
greater mean quality on three out of four problems for those who had 


criteria information (see Table 7.2). , , 

i i ‘periments on 
We must conclude that instructions are very important a i a 
productive thinking, as in experiments on flicker fusion, oper di g 
> relaxed conditions and 


P ; ly, 
and other performances by human me NoPE pa yet are instructions 
instructions not to evaluate are sometimes helpful; better y 


‘ iteria. Although some of 
these experim e designed to refute certain icy 
a ai ents were esign m G 
brainstormi I sitive side they demonstrate the intellectual flexibility 
orming, on the pos + ability to adjust their output to the 
of educated young adults, and their ability 


apparent demands of the situation. 


ucTIONS ON MEAN QUALITY OF SOLUTIONS TO 


ay 5 i 
ABLE 7.2. EFFECTS OF INSTR JECT WRITES ONE SOLUTION 


FOUR PROBLEMS WHEN EACH SUB. 
i ce Cartoons 
Instructions Plot Titles Conclusions Sentences 
7 4 6.0 
Standard 7.7 7.0 8 4 60 
Criteria 77 9.8 4 


From Johnson (1968). 
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PROBLEM DIFFERENCES IN TH. 
The relation between quality 


Qohnson, 1968) by having 


solutions with the benefit of 
and Sentences, w 


difference betw 


E QUALITY-QUANTITY RELAT = dail 
and quantity was investigated in iii ap 
different groups of subjects aad PERNE 
criteria information. Two problems, 


search the 
si M oe s researc 
ere chosen for this study because in previous 
‘cen single-solution 


‘ ean 
ione in if 
and multiple-solution pe Sentences: 
i a aries y or S 
quality of solutions had been large for Conclusions and small 
The data for groups 


a 
in Lig. 75, anda lop 
of 25 to 30 subjects are shown in Fig. 
arithmic curye 


g on of this 
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has been drawn to each set of points. The equate 
form that best fits the 
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Onclusions Y — 


f e ve between pro : the 
i decline in quality and the difference to assume a 0 
n rate of decline need explanation here. It is reasonable 

Subjects take instru 
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5 . » solution 4 e tv 
Cons to write more than one sol uld not be tof th 
Ji d . r 0 Pi 
ower their standards, but instructions to write four w‘ 


ŝ e 
ering 
a a 3 -Jowenna 
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number requested would probably be curvilinear. An explanation of the 
difference between the two curves must take account of the quantity and 
quality of available solutions. The number of good sentences that can be 
constructed from the four given words is almost unlimited, hence the decline 
antity would be small. The number of good conclusions to the chart 
hence the curve falls sharply with an increase in 
ho tried to write many conclusions frequently 
ect of the chart, and such fragmentary 
solutions got low ratings. The high mean quality of the single conclusions 
suggests that one good solution is easily constructed and differentiated from 


Thus the quality-quantity relation depends in part on 
vailable solutions as a function 


with qu y 
is distinctly limited, however, 
quantity requested. Those w 
wrote conclusions to only one asp 


others of lower quality. 
the shape of the frequency distribution of a 


of quality. 
Ncither problem had only one right answer, but on the continuum from 


divergent to convergent thinking Conclusions would be located toward the 
òo ` . : 

convergent and Sentences toward the divergent end. Thus this analysis offers 

a preliminary differentiation of solution processes at different locations on the 


convergent-divergent continuum. 


INDI S ’ F 
In ie iaie = wer many solutions, it is possible to examine the 
relation between individual differences in quantity and quality. es 
quantity is emphasized, some write more than others, probably because the 
instructions are somewhat ambiguous and the subject can choose E er = 
emphasize quantity or quality. Parnes and Meadow a aa i i 
tions of .64 and 81 between number of solutions written an number of high- 
t al. (1964) reported 


; ach, e 
quality solutions written. Parnes (1961) and Gens Se coe, an iS kaoh 
similar correlations for simple production tasks. Johnson, ; 


(1968) computed correlations between the quantity seme ~ pi gealty 
Scores, namely, mean quality and number of a dies sag ate 
groups on five problems. mec substantia pro To , miea Ta iroi 
all solutions showed consistent negative correlations wit a cea 
Wrote more solutions wrote solutions of lower mean qua om $ mE 
ing correlations between quantity and number of superior 


Zero š 
; ations described 
Stich: corrélations are probably secondary effects of the = K PEES 
above, Those who take the instructions t° acne ie If enay is 
= . f 1S = 
of A i ir mean quality ‘ 
quality act accordingly and thus the t this increase 


3 ions is increased; bu 
em . solutions is Inc ; > 
phasized, the number of superar a afcient, in the case of substantial 

1 


a quantity effect apparently jon in mean quality. 
Problems, to overcome the general reduction á 


MENTS 


REORGANIZATION OF pe s it does in the solution of 


; inking & 
a Reorganization aeeie ia divenn, Le of convergent thinking. 
n: a i othe ~ k , 
agrams, arithmetic problems, an! ig OF short phrases, the production 
Wi = 


i V 
hen the task is to produce single 
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PROBLEM DIFFERENCES IN THE QUALITY-QUANTITY RELATION letail 

The relation between quality and quantity was investigated in more deta 7 
(Johnson, 1968) by having different groups of subjects write 1, 2, 4, py : 
solutions with the benefit of criteria information. Two problems, sone ng 
and Sentences, were chosen for this study because in previous research oc 
difference between single-solution and multiple-solution conditions in — 
quality of solutions had been large for Conclusions and small for aie 
The data for groups of 25 to 30 subjects are shown in Fig. 7.5, and a a 
arithmic curve has been drawn to each set of points. The equation of i i 
form that best fits the data for Sentences is Y = 8.3 — 1.60 log X, and fo 
Conclusions Y = 9.4 — 4.78 log X. p 

Both the general decline in quality and the difference between pro ie 
in rate of decline need explanation here. It is reasonable to assume that 
subjects take instructions to write more than one solution as permission A 
lower their standards, but instructions to write four would not be twice 4 


has ‘ i è a A of the 
permissive as instructions to write two. The standard-lowering effect 


Sentences 
ee 
“= 


Mean quality 
N 


Conclusions 


1 2 3 4 5 6 
Quantity requested 
FIGURE 7.5. 
subjects were re 
decline in mean 


30 


Quality and quantity of solutions to 2 problems. Groups of 25 to e the 


4 ome crib 
quested to write 1, 2, 4, or 6 solutions. Logarithmic curves des 
quality as quantity increases. From Johnson (1968). 
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number requested would probably be curvilinear. An explanation of the 
difference between the two curves must take account of the quantity and 
quality of available solutions. The number of good sentences that can be 
constructed from the four given words is almost unlimited, hence the decline 
with quantity would be small. The number of good conclusions to the chart 
is distinctly limited, however, hence the curve falls sharply with an increase in 
quantity requested. Those who tried to write many conclusions frequently 
Wrote conclusions to only one aspect of the chart, and such fragmentary 
solutions got low ratings. The high mean quality of the single conclusions 
sts that one good solution is easily constructed and differentiated from 

uantity relation depends in part on 
f available solutions as a function 


sugge 
others of lower quality. Thus the quality-q 
the shape of the frequency distribution o 


of quality. , 
Neither problem had only one right answer, but on the continuum from 
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civ J thinking Conclusions would be 
a d he divergent end. Thus this analysis offers 
convergent and Sentences towar the g a be. fa 
a preliminary differentiation of solution processes at different locations on the 


convergent-divergent continuum. 
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relation between individual differences A je robah cen 
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instructions are somewhat ambiguous na Moke (1959) reported correla- 
ae hacia quantity or quality. Pat solutions written and number of high. 
ions of .64 and .81 between number ach, et al. (1964) reporte: 
quality solutions written. Parnes (1961) ae Parrott, and sare 
Similar correlations for simple production tas ae score and two quality 
C1968) computed correlations n superior solutions, for e 
Scores, namely, mean quality an i ial problems, mean quality o! 
Sroups on five Agila On these more one quantity; those who 
all solutions showed consistent negative cone “ames quality. The correspond- 
Wrote more solutions wrote solutions of lower > 
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process can easily be conceived as the retrieval of previously learned informa- 
tion, perhaps even as the sampling of a specified class of words. When the 
task is more complex, as in writing a sentence or paragraph, such a simple 
conception is less plausible. Previously learned elements reappear, of course, 
but not typically in the form in which they were originally learned. 

To see what happens to information acquired immediately before a produc- 
tion task, Maier, Julius, and Thurber (1967) gave college students 12 pairs 
of words to memorize, then asked them to write a story about a prisoner, using 
as many of the words as possible. Later, they were asked to write a second 
story on a topic of their own choosing. Under these conditions, pairs of words 
that fit together and could easily be used in proximity, such as damp-weather 
and hot-soup, often reappeared as pairs in the stories. Pairs of words that 
could not easily be used in proximity, such as steal-prison, seldom appeared w 
the original form as word pairs, but single-word fragments of the original pa 
often appeared alone. Many of the words from both types of word pairs 
appeared in the stories paired with other words as new pairs (e.g a 
prison). As to individual differences, the subjects tended to be somewhat 
consistent from the first story to the second in frequency of use of old parts 
of single words, or of new pairs. f 

Stratton (1970) applied similar methods to study of the Sentences problem 
described earlier, which requires the construction of sentences from the wor : 
happy, expensive, horse, and lake. A group of subjects who had memorized 
sentences that included the pairs happy horse and expensive lake wrote Sen 
tences that contained these clichés and thus got lower quality ratings ee 
the control subjects. Another group of subjects who had memorized sent 
that included several unusual meanings for horse and lake wrote sentenci 
that reorganized these nouns into more unusual constructions, such as 
around and Happy Lake Resort, and thus got higher-quality ratings: -on 

Although creative thinking has often been viewed as the reorganizat 
of experience, empirical investigation from this point of view is not nee 
Reorganization will not show up unless the methods permit it; the experimen” 
above, together with others now being used in the study of verbal ae 
ey disclose how words acquired just before a production task are, Of cap thet 
reorganized. But study of the reorganization of information acquired at ot 


. . . pua 
times in other places in the more remote past for use in the present produ 
task will be more difficult. 


PRODUCTION SEQUENCES 


. pce 
When someone writes many 5 


: solutions to a problem, a record of the sé aise 
of solutions offers an opportunity for testing hypotheses about the proc er 
by which they are produced. It is anabe te suppose that the er a 
produces his best solution first, then his next best, and so on, in descen ™ e 
order of quality. The opposite sequence also seems reasonable: the Sa ma i 
come later. The number of solutions and their quality must be consid? 

as well as the complexity of the task. í 


we 
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SIMPLE RESPONSES 
Some years ago, Bousfield and Sedgewick (1944) published cumulative 


curves—two of which appear in Fig. 7.6—showing that the production of 
single words of the same class decreases with time. When the number of 
a period of 10 minutes or so is plotted cumulatively, the 
production curve can be represented by an exponential equation, n = 
eC] — emt), in which n is the cumulative frequency, c represents an upper 
limit, e is the base of the natural logarithms, m is the rate of approach to the 
limit, and ¢ is the elapsed time. 

The general shape of the production curve has been confirmed by other 
experiments, but there is some question about the reason for the decrease in 
production rate. Bousfield and Sedgewick suggested that the upper limit 
represents a supply of words which is sampled without replacement. As the 
supply is depleted, the production rate decreases. Indow and Togano (1970) 
described one scanning model for scanning all items in the supply in order 
to reject those already produced and a second model that moves the scanning 
process to regions of lower density. The upper limit implied by these models, 
and by extrapolation of the empirical curves, cannot, however, represent the 
total supply of words in the subject's vocabulary but more likely the supply 
under specific experimental conditions. When Merrill (1950) asked college 
students to write words beginning with S, the above equation, applied to the 
data of one subject, yielded a value for c of 132. But 12 percent of the words 
in the dictionary begin with S, so if 132 is 12 percent of this aa 
vocabulary, his vocabulary would be 1,095 words, an absurd value. Furt a 
more, this equation gave larger estimates for words beginning with ha cee 
constitute only about 2 percent of the dictionary, than for words beginning 


words emitted over 


Associates 


Time in min. 


ked, in one 
FIGURE i duction of words. College students were asked, in o 
PAR tl pot ee Ties of fellow students, and in te ahea rak of 
animal names with two or more syllables. From Bousfield and Sedgewic ¢ 3 
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with S, which constitute 12 percent. There must be some task-related limit 
on the availability of the words, perhaps an associative interference whereby 
some words get in the way of others during copious production. Another 
possibility is a general work decrement or simple boredom with the task. When 
increments per minute are plotted, the curve looks like a fatigue curve: 4 
short rest is followed by partial recovery (Hall, 1960). All in all, if we ask 
about the number of simple responses produced in a short time, the hypothesis 
of sampling without replacement is still tenable, but there are questions 
about the class of responses sampled and about performance factors. 

Turning now to the quality of the responses, it is clear that the production 
of simple responses is characterized by the early appearance of the more 
common ones. Bousfield and Barclay (1950) found that words of high 
frequency of usage were produced early. Christensen, Guilford, and Wilson 
(1957), reversing the proposition and calling uncommon responses creative, 
found that more creative responses were produced late: more unusual uses O 
a pencil or a button, more uncommon figure concepts, more uncommon 
number associations. The occurrence of more unusual uses late in the produc: 
tion sequence has been confirmed for 5-minute and 15-minute periods (Parnes, 
1961) and for 10-minute periods (Manske and Davis, 1968). 

Such results match the hypothesis of a response hierarchy, derived from 
association theory. For convergent thinking (as in the anagram problems 
of Chapter 4), the hypothesis is stated in espact to problem difficulty: for 
divergent thinking with multiple solutions, it is stated in respect to order: 
common responses occur first, followed by the more uncommon response 
Ana the uncommon responses are of higher quality, on one dimension ° 
creativity, than the common ones. f 


COMPLEX RESPONSES 

The production of complex solutions to problems differs in som 
from the production of simple responses. A complex response, proc 
demand, resembles a multidimensional class concept in that the spec 
of the product are analogous to the attributes of the concept. Instea 
requesting names of birds, we can request two-syllable names of birds ~ 
conjunctive class), and college students can produce these quite reac 
College students can readily produce relational classes also, such as pa's A 
words that are synonyms, ‘or antonyms, or books and their authors: iit 
attributes may, of course, be abstract dimensions: “Name women who ie 
intelligent as well as virtuous.” It is probable also that many thinkers supp g 
ment the specifications of the task with a few of their own, such as pewa 
conformity, or avoidance of the obvious. Although some of the lement 
complex responses may be drawn from a reservo of experience—a"4 = 
portion of the operation resembles recall—a large portion comes fram” 
organization of the elements, from integration, construction, or in 

Thus complex solutions usually wary on several dimensions pert 
creative achievement—such as cleverness, originality, and appropria 
as well as on statistical uncommonness. If we speculate about a ca sS 
which the solutions are drawn, it must be smaller than a compara 
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of simple responses. The number of solutions produced is usually less over 
equal periods of time. As a specific illustration, in the Manske and Davis 


(1968) experiment, instructions to be original led to the production of 11.4 


uses, and instructions to be practical to 18.4 uses; but instruction to be original 


and practical led to 10.2 uses. In contrast with the decreasing production 
nses, the curves exhibited by Christensen, Guilford, 


curves for simple respo 
impossibilities are 


and Wilson (1957) for writing plot titles and listing 
linear over working periods of 12 minutes. The slow initial rate of produc- 
the small supply of suitable words. The linearity of the 
production curve could be due to the weakness of the factors hypothesized 
g rate of production. When solutions are complex, avoidance 
e task less boring. 


tion is caused by 


for a decreasi 
of repetition is less difficult and th 
As to the quality of the solutions to complex problems, the number of 
superior solutions increases, in agreement with a response hierarchy inter- 
pretation, for some problems but not for others (Christensen, Guilford, and 
Wilson, 1957). Similar inconsistencies appeared in the results for the 5 
problems studied by Johnson, Parrott, and Stratton (1968), but much of this 
inconsistency has been cleared up by Stratton’s reanalysis of the data, shown 
in Table 7.3. In this experiment, 40 college students were instructed simply 
7 minutes. Another group of 40 had instructions 
idge these solutions and select their best. 
Others were given the criteria of good solutions, and others again had both 
if and the request to select their best. As to number of 
superior solutions, the totals for all problems show that the largest dif- 
halves occurred under the simplest condition. 


ference between frst and last geam i z hee diet 
When judgment was requested or criteria information given, the ditrerence 
8 


was smaller: and when both complications were added, the difference dis- 
eS A . é " 
appeared. It is noteworthy also that the increase under simple instructions, 


to write many solutions in 
to write many solutions, then to Ju 


the criteria information 


TABLE 7.3. nuatpen OF SUPERIOR SOLUTIONS IN FIRST AND LAST HALVES 


OF PRODUCTION SEQUENCE FOR 
UNDER FOUR PRODUCTION CONDITI 


FIVE PROBLEMS 
ONS 


Sentences Cartoons Totals 


Plot Titles Table Titles Conclusions 


Conditions First Last First Last 


Half Half Half Half H 


First Last 


First Last First Last First Last 
alf Half Half Half Half Half Half Half 


Many solutions 9 19 4 5 7 9 6 16 7 B 33 62 
Man ; 
y solutions 
with judgment 12 12 7 4 6 11 11 13 19 19 55 59 
Many soluti 
Aor 
Wihges a g roug 13 17 12 14 55 6 
Many solutions 
with jud. 
and a lo 16 7 9 17 Io 13 18 al R 68 65 
Totaly osgan ae 43 64 59 58 211 251 
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from 33 to 62, did not bring this group up to the initial lev el . pi ea 
solutions produced under the more complicated instructions. O by spa ee 
more specific instructions increased the number of superior solutions | 
at the beginning, after which improvement was small. ous uae T 
A later study by Stratton (1970) analyzed production sequences ro) e 
Sentences problem in more detail, for sequences of five a it 
solutions. Mean quality increased over five solutions with anc i ; im 
criteria information, but over sequences of ten solutions mean qua n a 
creased only without criteria information. Stratton went further and ee 
the separate dimensions of the solutions as they varied over a i did 
sequences. The length of the sentences increased, and in genera purer 
the number of times the four given words were used with unusual mi 8 
The number of times that the given words were paired in a qantas 
expensive horse) declined over a sequence of five solutions, but, a 1 ‘not 
some other dimensions, such as the complexity of the sentences, E bi 
change as expected. When the solutions to be produced are complex, meer 
not reasonable to expect that all the dimensions will change in the sa 
way, and this analysis shows that in fact they did not. seas. qyere 
In Table 7.3 the differences between problems, over all —, oe 
of the same order of magnitude as the differences between first at i: 
halves. For Sentences and Plot Titles, the increase was large; for Table - i o 
Conclusions, and Cartoon Captions, it was negligible. The compariso A 
Sentences and Conclusions is particularly enlightening because phei ae 
problems have already been compared in respect to the quest Con- 
tion. Fig. 7.5 displayed how the mean quality of the solutions to “lutions 
clusions problem decreased sharply as a function of the quantity of a d this 
requested while the mean quality of Sentences decreased slightly. / a to 
difference was apparently due to the smaller number of good ane Table 
the Conclusions problem. Putting these data together with those T to last 
7.3, we can say that for any problem the improvement from frst a aë 
half of a production sequence is related to the number of potential sul 
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Tn general, two factors determine the difference between the imit 
pa 5 i pr ; 
solutions in the first and last halves of a production sequence. Img uality i” 
is maximal when the instructions are general and therefore the quan” 
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the first half is low. Improvement is maximal for problems with man) 
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zag, almos 

4 i 4 ro et10ons, « : 
solutions. Thus in the case of Sentences with general instructie half as iP 
three times as many superior solutions were produced in the last he cific 


the first. In the case of 


i ae der spe 
Conclusions and Cartoon Captions un 
Instructions, almost tw 


s 5 g alf. 
ice as many were produced in the first ha 


Te TOTTE AND DISCONTINUOUS PRODUCTION 
I 


-kor has strugs 
ne concept of incubation implies that after the thinker ha him late 
i . es tO 
with a problem and then abandoned it, the solution often comes t 
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while he is attending to other things. Many eminent writers, scientists, 
and inventors have testified that such sequences of events have happened 
to them, and psychologists have tried to explain this phenomenon by hypothe- 
ses about unconscious work and the reduction of interferences (see Johnson, 
1955). Such matters are difficult to study objectively, but two experiments 
have manipulated the most obvious variable by comparing output dur- 
ing a continuous working period with output during discontinuous periods 


of the same total time. 
In one experiment of this design, Gall and Mendelsohn (1967) gave 


college students 2 minutes to work as many items of the Remote Associates 
Test as they could. Then each subject was given 25 minutes to do five 
problems that he had been unable to solve, but half the subjects first had 
a period of 25 minutes of unrelated activity which was supposed to facilitate 
incubation. Most thinkers who have claimed benefits from incubation refer 
to concentration on a single problem, so this may not be a fair test; in any 
event, those who worked continuously did considerably better than those 
who worked discontinuously. On the other hand, when Fulgosi and Guilford 
(1968) had college students produce solutions to two items of the Con- 
sequences Test, they found a small increase in number of immediate con- 
sequences following a 10-minute interruption (with no interpolated activity) 
20-minute interruption. There was no 
increase in the number of remote consequences; in other words, there was 
an improvement in quantity but not in quality. So far as objective ogy a 
from these rather superficial experiments is concerned, the status of in- 
cubation therefore remains uncertain. 


and a large increase following a 


JUDGMENT OF THE SOLUTIONS 
solutions to a problem, the question of 
identification of the best arises. There is no advantage p —_ a 
superior solution unless it is identified as such; like maeng p othes without 
trying them on, producing solutions without geome ee es = ja 
finished performance. For experimental purposes, superior a r on ó 
identified by the experimenter, put surely this is an z ohe — <a 
erally speaking, the thinker selects the one eee zhe w a - 
mission to a journal, a museum, OF 3 piene ones tity ie the ebe 
use. We have seen that instructions emphasizing ae a y op mans er 
of superior solutions, but increase the number of oer Sense te om 
If the thinker chooses at random among the A te the pil en pE : 
Proper measure of overall performance <a e sa one l A as 

this is likely to be lowered by the quantity emp. pais loan rs fy inker 
does not choose at random. Hence we must Bee my Ae igment a 
the solutions as a separate process that makes its own contribution to the 


Overall enterprise. 

A small-scale study (Johns 
college students could judge $° 
own solutions and those written 


When anyone produces many 


on and Jennings, 1963) demonstrated that 
lutions to the Plot Title problem—both their 
by others —with fair accuracy when their 
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choices were compared with the choices of expert judges. In respect to 
individual differences in the time spent on each process, judgment and 
production were somewhat independent. Johnson, Parrott, and Stratton 
(1968) therefore examined this judgment operation more thoroughly by 
requesting some subjects in the experiment described above to select their 
best solutions to five problems. Such a request is feasible because of the 
large intraindividual variability in quality of solutions produced by each 
individual. In general, the solutions preferred by the subjects as their best 
were better than the others; when given the criteria of good solutions, the 
difference between their preferred and nonpreferred solutions was even 
larger. Nevertheless, their judgments were not as good as they might be. For 
example, a subject who wrote seven solutions might pick the best of the seven, 
as measured by the ratings of expert judges, but he might pick the second best 
instead, or the third best. The third best would be Letter than the average 
of the other six but not much better. Hence, when the subjects who wrote 
many solutions were given the criteria of good solutions and asked to select 
their best solutions to each problem, their prefered solutions were about the 
same, in mean quality and number of superior solutions, as those of the 
subjects who wrote only one solution. In fact, when the proper comparisons 
are made, it appears that the standard condition of asking the subject to write 
OnE solution to a problem is a hard one to beat. Such findings indicate that 
multiple-solution experiments, which ignore the judgment process and do 


not i i ; ae G 
t include a single-solution condition as a baseline, are incomplete. They 
suggest, furthermore, th utions 


might be worthwhile, 


The introduction to this chapter made passing reference to the dependence 
w Fe ey creative achievement on favarable intellectual conditions 
ie a ii Similarly, productive achievement in the classroom depends pa 

vel achers recognition of a student's achievement, The student whose 
ma i aa the teacher’s criterion of productive achievemen 
direction. And in the ; aloe pr are meer. a subject 
eenen odefa ae = ihe psychology lemalo Mai aggin 
the fortunate one. It i w h Poum Aana hy nereng p d Aid 
more judges who h 1s true that careful experimenters have used me? 
nidoe 8 5 ave worked together until they reach satisfactory n A 
i : Sireementy but this is not the end of the matter. If the solutions me 
the flee esters a to ask which dimensions of the solutions deter! 

N Ce a ra dhe outcome of the experiment. gilt of 
judgments of solutions ode tae A His Jand by paa for ex 
ample, the overall rating of mele ates, [rolemi in a od 65 with 
number of unusual cnet a i the sentences was ape E = with 
number of ti i g ached to the four given words, -*? e 
irade ebi imes the given nouns were used other than as the aben ais 
fe am see si 26 with sentence length. This e erlne 
he j 3 prove that judges are human, but to Ur% eis 
the Tmpartance of the judgment process and of the match between the dim 
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sions that the producer emphasizes and those that the judges—whether 
literary critics, museum directors, or editors of scientific journals—emphasize. 


PRODUCTIVE THINKING IN THE PSYCHOLOGY CLASSROOM 

Most research on productive thinking has avoided problems from any 
specific course of study, presumably for methodological reasons, but the 
introduction of productive thinking into a college course is not at all dif- 
ficult. At Michigan State University problems have been constructed in 
formats similar to those described above and in content that samples standard 
introductory psychology textbooks. The instructions and three examples are 
printed below. Reliability of scoring is indicated by the correlation between 
the ratings of two judges. 

This part of the examination emphasizes creative thinking. You can use 

the knowledge you have acquired from the textbook and the lectures but 

you should go beyond this to produce an original solution to each problem. 


Short answers, never more than one sentence. 
Suppose television had been invented and had become popular before books. 


What would be the consequences for the learning processes of children? 
Cr = .88) 
Suppose you want to train an animal in a new, unique way. Invent a 
reinforcement schedule different from those used by others. (r = .93) 
Write an original sentence that inc 
motivation. (r = .87) 


Judes the words: easy, performance, 


rmance on these tests is influenced 


It will be interesting to see how perfo ; 
and teaching procedure. 


by variations in instructions, classroom atmosphere, 


Training for Productive Thinking 
Library shelves abound in books telling us how to train our minds to 
K agraphs of good advice in all of 


think creatively, and there are some par na ~~ 

them. The training procedures that have been subjected to empirical evalua- 
. e ò j 

tion are of most interest here, of course. Such procedures have been developed 


around hypotheses about productive thinking, and the outcomes of evalua- 
tion studies are relevant to these hypotheses. 


BRAINSTORMING 


Training in creative problem sol 


of Osbor : d on on 
n (1953), is grounded on š ; 
enrichment of idea in group interaction and another about increasing the 


quality of ideas by postponing criticism and emphasizing quantity. The 
evidence related to the second hypothesis was reviewed earlier; it is time 


now to estimate the overall value of such training. Np 
AS a check on qiieomes Meadow and Parnes (1959) administered several 
, 


tests of the Guilford creative thinking type before and after a 30-hour 
Course in creative problem solving and before and after unrelated courses. 


the brainstorming techniques 


e hypothesis about the exchange and 


ving, by 
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They found increases in quantity of production of unusual ia pak 
titles, as one might expect; but they also found increases a hig ipp 
products, as indicated by scores for uniqueness and value o! an cs 
three tests of unusual uses. The difference on the plot title test aia ie 
high quality was small. The tests were somewhat like the practice eC 
but not identical; the authors concluded that the course in creative pro 
solving had been effective. < (1959) 
Somewhat similar results were obtained by Parnes and Meadow ( A 
comparing performance on unusual uses by subjects with and hn 
course in creative problem solving. The most convincing results came oe 
a follow-up study (Parnes and Meadow, 1960) eight months after a “re 
in creative problem solving. As compared with control groups marches d 
verbal ability, the trained subjects showed improvement on all tests, inc! 
ing plot titles rated for high quality. aiii 
Even a brief training period seems to be effective, at least tempora kA 
Comparing individual production before and after 10 minutes ob ee 
training, Lindgren and Lindgren -(1965a) found an improvement 1N nd 
creativity level of captions written for cartoons by California students 4 
also (1965b) by Lebanese students. The same 
a gain in ratings of sketches for 
storming (Lindgren, 


type of experiment a abe 
a postage stamp after 10 minutes of sc 
1967). In these three studies, the critical ar tan 
the mean quality of all productions, not the number of superior pe 
Hence the gain could not be merely a consequence of an increase in sear! 
The emphasis of brainstorming on quantity instead of quality is a g¢? nits 
content-free technique, intended to aid the production of ideas on any eae 
It is also a negative technique, intended to remove barriers that a 
the thinker from making fluent use of the ideas he already has. These = 
features limit its effectiveness for more specific positive training pee ae 
As distinct from the immediate effects of instructions, the gains thät nde 
been reported are rather small and hard to interpret. On the positive $ 


P 5 i: s practice 
however, there is the possibility, noted by several investigators, thak 5 ition 
= K a ag i . dU 
with this novel techinque stimulates an enduring interest in the pro when 
of ideas, Brainstormi 


5 a5 and 
ae ng is more fun than most parlor games ar a w 
dignified by an organization’s official st 5 


can 
respectable within the org ; 
claim legitimate status a 
just be dismissed as 


# akeg tho 
amp of approval, it makes ie 
aie : ` ; min 
anization as action, Certainly, — cannot 
S a promising psychological technique an 
a Madison Avenue merchandising stunt. 
ASSOCIATIVE TRAINING me 
ý . e sar 
"We are assuming that originality can be learned and that me p% 
principles of conditioning hold as in other forms of operant behavi jatively 
ETET a siie - cc 5i i 
By originality, or original thinking, we mean behavior that occurs Ta those 
. . i i r 5 
infrequently, is uncommon under given conditions, and is relevant : 


conditions.” This quotation from Maltzman, Simon, Raskin, and 


l + productiv® 
(1960, p. 1) outlines the basis for another attempt to improve prec 
thinking. 


us 
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One experiment (Maltzman, Bogartz, and Breger, 1958) began with 
ordinary word associations: 25 words, like pencil, to which the subject gave 
another word in response. Then the subject went through the list again 
5 times altogether, with instructions to give different responses, after which 
f 25 words. In comparison with a control 
group, the training groups gave somewhat more unusual responses. Ambigu- 
ous results were obtained from a test of unusual uses for a button, included 
as a test of transfer, The instructions were varied also, and those who were 
instructed to be original on the test list gave more unusual responses. In 
fact, the difference due to instructions was about 5 times as large as the 


difference due to training. 

Since these results were promis 
Raskin, and Licht (1960) undertook an extensive series of further experi- 
ments along the same line, varying the conditions to tease out the important 
factors in this associative approach to originality training, but depending on 
the same two tests: uncommonness of word associations and unusual uses. 
The best training was to give one list of stimulus words twice and ask for 
different response words the second time. This procedure increased both un- 
common word associations and originality of uses. Repeating the task five 
or ten times increased uncommonness of the word associations but not origin- 
ality of the test of unusual uses. ae 

So far, associative training looks promising, but some criticisms of the 
results and interpretations have arisen. When Gallup (1963) tried to 
duplicate these results, he found improvement from one test to another 2 
all groups, even in those that had unrelated training 1n arithmetic, for 
example. Hence he suggested that the results of the training might be ii 
to self-instructions or to a shift in the set for such tasks rather than to the 


he was tested on another list o 


ing but inconclusive, Maltzman, Simon, 


associative training itself 
aining itself. wk ; 
The criticism of Caron, Unger, and Parloff (1963) dealt with | he SCONE 
association training with a strict criterion 
of relevance of the responses and got no improvement at all Poy ei 
Concluded that the associative training “may enrich one’s fantasy lite but 
without benefi i ity of [p. 422].” 
nefit to ingenuity of output LP- ; 

t=) J Š n 

Some of these discrepancies have been partly caie by bn ham 
Arenson, and Panman (1964). In their experiment, the effects of instructions 


to be otiginal were small and temporary; but training in word sain 
With six lists, following Maltzman’s procedure, did increase oe 
Ohi the istia] ages fest, Since common responses were increased, as well 
as unique responses, they stated that the increase in ji are = 
a consquence of the general increase in prodici > valun erpretation that 
is consonant with the general quantity e ect mentioned above. "m 

The Remote Associates Test can also be used aso CSEE S Ene anene 
of originality training. Maltzman, Belloni, and Fishbein (1964) tried out 
their associative procedure on this criterion but were successful only when 
the stimulus words, or words closely associated with them, also appeared on 


the test. They concluded that “performance on the problems employed which 


standards. They repeated the word 
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have only one correct solution can not be facilitated by the oo 
ing p np successfully employed with tasks having no one 

oe li la shown that those who obtain high scores on the Remote 
Associates Test produce more words in continuous production, Eraediman 
C1965) tried a somewhat different kind of practice. Hle encouraged aberey 
of production simply by having the subjects practice giving rapid associations 
to ten stimulus words. Such practice did iner 
Associates Test, althou 


se scores on the Remote 
gh equivalent practice in defining words did not. 
Apparently the demand for greater productivi 
greater productivity on the transfer task, 
ondary effect of productivity. 


during training leads to 
with increased originality as a sec- 
Ray (1966) also has suggested, from the results 
of a training experiment, that it was the subjects’ expectancy of many solutions 
rather than the training that increased the number of solutions. 

The original assumption about the effects of 
been entirely negated by later experiments, 
practiced words and closely re 


associative training has not 
More associations with the 
lated words apparently are acquired. But this 
rather narrow effect must be evaluated in relation to two more gencral 
effects: the establishment of an adaptive set for such tasks as a result of 
instructions and self-instructions, and the general 
which, depending on the scoring stand 
quality responses, 


increase in productivity 
ards, can include an increase in high- 


TRAINING FOR THE 
The influence 
ing of another p 


laboratory and classroom experime 
Levy C1968) has pushed the 


fs We ieee 
Y Proposing “that originality 


ROLE OF ORIGINAL THINKER 
of instructions from one 


Person on the productive think- 
erson has already bee 


n well documented here. In many 
nts this has been the strongest influence. 
interpersonal interpretation a step farther 
is better conceived as a form of role-defined 
behavior than as a form of Operant behavior, and that the effects of originality 
training may be understood as due to changes in the person’s criterion for 
an appropriate response rather than to increases in the associative strength 
of original responses [p. 72].” 

To compare these differe 
college students to take on 
Mr. Cor Miss) S.J. One 
structions to respond wit 
and the experimenter 
something similar after 


nt conceptions of origin 
the characteristics of an 
group was give 
h the first wor 
gave verbal 


ality training, Levy asked 
other person's personality, 
n a word association test, with in- 
d that would come to §.J.’s mind, 
reinforcement by saying “good” or 
each uncommon response. Each subject of another 
group was shown a list of uncommon responses attributed to a role model, 
S.J., and told to study it in order to be able to put himself in S.J.’s role- 
A third group was given a descripti 


ption of S.J. and instructions to play 
personality portrays him as creative, 
and interesting ideas [p. 74].” 
the role instructions. 


or novel way; his whole 
Imaginative, and always full of new 


A fourth had both 


the reinforcement and 


EO COLOLLLL << ee 


w 
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All groups improved in the number of uncommon responses after training, 
but the role-instruction training was slightly more effective than the reinforce- 
ment training or the role-model training. The combination of reinforcement 
and instructions was the best, of course. Transfer effects were tested by giv- 
ing cach subject the Holtzman inkblots, with printed alternative responses, 
choice of the uncommon responses being the measure of originality. All train- 
'N groups registered some transfer compared to a control group, but on this 
= also the role-instruction training was more effective 

ý 


than reinforcement 


tole-mode! training. 
i The unusual feature of this experiment was that 
subjects were asked to fill out rating scales to describe their impression of S.J. 
with role instructions rated S.J. as original, but 
ation that the subjects in the reinforcement 
8foup made the obvious inferences and also rated S.J. as original. Such 
reslis support the suspicions of many experimenters that verbal reinforce- 
aral p i similar a ange aie is na limited to strengthening the 
i : has the cognitive effect of informing the subject also of their 
*Ppropriateness. According to Levy, the instructions provide the subjects 
ie lel definition of their role; thereby suzy sina Pe a = 
iba judging response appropriateness. $ ñen the reinforcement makes 
e role more specific. 
' lt will be difficult to establish that the subjects were adopting the role of 
lh nig mamian t oe O aaen a 
. Actually, the rolemodel training was the le 
tee, Procedures, But much of the evidence on instructions, criteria sania 
» And associative training in this chapter and on set in Chapter 4 requires 
Swen that takes account of the personal eS en 
inate and subject. Levy believes that his ogniwa $ a nic toe 
tk a ation” permits more explicit definitions and more precise exp ntal 
pul generalizations that are neces 
effects of originality training. 


after the training all 


It is ae 
ii 'S not surprising that those 
is criti E s 
critical to the role interpret 


an 


sary f, ations than the complex mediated 
vely € . i a . 
+ ‘0r an associative interpretation of the 


TRAINING IN PRODUCTION STRATEGIES 
me ns lyrical descriptions of creative thinking eae gf degani 
Plined activity, to be left free from control so that fervent 1de 
a up from the depths of the unconscious. Others conceive | it as a 
See al Systematic process and thus generate ae peparap i 
on Ctions to “stop and think” are generally beneficia ; E so mt ip 
in whether communicated by instruction from others i adopted A 

struction by the thinker. Specific information about the criteria tor 


goo f 
solutions also improves performance. 


S to worki i spically 

and g; ng methods, artists are typ:cè à ae 

is : T Eindhoven and Vinacke, 
A952 play more control over their creative efforts (Ein 


f - Writers also speak of control over their medium; certainly, they 
hans Write novels or poems one word at a time, by continuous association. 
m what distinguished poets tell us (Arnheim, Auden, Shapiro, and Stauf- 


tation 


more systematic than nonattists 
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fer, 1948), the initial ideas may come from anywhere, perhaps by te 
tion, but soon a structure is outlined, at least tentatively, and then f ed p 
with the right words. Hence it is not unreasonable to design propig Si 
explicit recommendations about strategies for divergent as well as converge 

thinking. DETA 

Since the phenomenon of incubation has been reported by many success 
ful thinkers, and disregarding for now the uncertain evidence from one-shot 
experiments, an alternation strategy can be proposed. The hypothes ; sug: 
gested by the testimony of eminent thinkers is that, when one is stuck on a 
difficult problem, it is better to stop a while and then return to it, rather 
than plod on without interruption. This does not exclude the possibility of 
going over the facts or of daydreaming about the solution during nonwork 
periods because reports of incubation have explicitly referred to such activities. 
Nor is it stated in terms of efficiency per unit of time because productive 
thinkers seldom mention this measure. 

Many training programs have been designed for advertising men and for 
engineers engaged in research and development (Allen, 1962: Arnold, 1962: 
Gordon, 1961; Lincoln, 1962). Typically, these begin with a few inspiring 
anecdotes, then recommend an open mind, sensitivity to problems, and other 
general requirements for effective thinking. The specific aids often consist of 
checklists—or even matrices and more elaborate diagrams—to facilitate a 
systematic approach to the development of new products and projects. One 
such list, abridged by Arnold from the book by Osborn (1953), consists 


simply of verbs: Adapt, Modify, Magnify, Minify, Substitute, Re: 
Reverse, Combine, Helpful exam 


actions listed are also supplied. 


range, 
ples of improvements coming from the 


LABORATORY EVALUATION WITH 
To evaluate such procedures, Ridley 
a booklet of Strategies for the unusu 


following suggestions. “Transform the 


What uses of your object do these tr 
illustration is a brick, 


with a hole in it, 
this experiment, word 
» or heuristics, produced much larger 
s training, as well as word 
plot title problem also. 


Proving a product, although they 


» are not as effective as one might expect, 
according to a recent study by Warren and Dav. 


JERE l is (1969). Given the assign- 
ment of writing ideas for improving a doorknob, college students with a 
long checklist of 73 hints actually produced fewer ideas than a control 
group, but those with a short list 


produced more. The most productive 
technique in this experiment, however, was one called morphological sy” 


thesis (Allen, 1962), which requires analysis of the dimensions of the problem 
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followed by a new synthesis. Ideas for improving one feature of the 
product are listed along one axis of a two-dimensional diagram and ideas for 
another feature are listed on another axis so that novel combinations appear 
at the intersections. It was this technique, along with the short checklist of 
hints, that stimulated the largest output of ideas. These two techniques also 
generated more ideas that were superior in respect to the experimenters’ 
ratings of originality and practicality—because of the general quantity 
effect. Mean ratings ‘of originality and practicality were not improved, how- 
ever. Klein, Frederiksen, and Evans (1969) also found an increase in the 
quantity but not the quality of hypotheses written by college students to 
explain research data as a result of having a list of acceptable hypotheses. 


CONFLICTING VALUES IN THE SCHOOL ENVIRONMENT 

The conflict in universities and research organizations between pressure for 
Creative achievement and pressure for more immediate practical achievement 
has often been described. The demands on the adult come partly from his 
own motivation, since he does have options for adjusting his own life 
style to the demands of different environments. Children, on the other hand, 
are much more vulnerable to the demands of the school environment. Pres- 
Sures from parents, teachers and peers, some explicit and some implicit, can 
Operate to strengthen or weaken the effects of deliberately planned training 
Programs, "Pies is considerable agreement about the value of reading, writ- 
ing, and arithmetic, but less about encouraging children to turn out fanciful 
ideas of their own before they have mastered the sound ideas of their elders. 
_ After unsuccessful attempts at improving children’s creative performance 
în the classroom, Torrance (1965) turned to an examination of the teachers 
attitudes since some seemed to enjov the opportunity to encourage creative 
activities while others seemed to be threatened by wild ideas and irritated 
Y irrelevant questions. From his observation of teachers’ responses to 
children’s imaginative activities and questions, Torrance then constructed an 
attitude inventory for creative motivation—reflecting an inquiring, searching, 
5 de—as contrasted with critical motiva- 
and inhibiting attitude. Tests 
y-school classes and again, 


"eaching-out, and courageous attitu 
"on—reflecting a controlling, censoring, 
°F creative thinking were given in 20 elementar sla 
With alternate form five months later, so that the gains in different classes 
could be related to he attitudes of the teachers in these classes. For the 
Pupils, the training program stressed quantity rather than quality, avoidance 
9 criticism, combination of ideas, reading constructively, word games, and 
c allenging assignments. For the teachers, it stressed respect for children’s 


i < > If-initiated learn- 
questions and imaginative ideas, and encouragement of se 


Ing wi ; os before and after trainin 
8 without evaluation. A comparison of scores 4 8 


“owed that more children in the primary and intermediate grades improved 
an in the kindergarten, as one might expects but it was in the kindergarten 


that the favorable influence of teachers with creative attitudes was most 
a 


evi; ries written by children 
dent, In a similar way, Torrance compared sto y 
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over a four-month period and found more improvement Gin terms of eee 
of sensitivity, originality, and richness) in stories from children taught by 
teachers with favorable attitudes toward creativity. £ 
Variations in instructions (illustrated earlier) were applied by Torrance 
to a regular classroom assignment in four sixth-grade classes and amplified 2 
a monetary reward. The pupils were asked to write stories around one o 
several suggested titles and were offered a prize for the best story. In two 
classes, they were told that the stories would be judged “on the basis of 
how interesting, exciting, unusual, and original they are”; in the other two 
classes, the stories were to be judged “on correctness of spelling, punctua- 
tion, grammar, and sentence structure [p. 134].” Substantial differences were 
found in the expected directions. The emphasis on originality allowed more 
than twice as many errors but led to higher ratings for originality and interest. 
Depending on the environmental reward, children of this age can trade off 


grammar for originality just as college students can trade off practicality for 
originality. 

EVALUATION OF CLASSROOM TRAINING PROGRAMS 

Very encouraging results have followed a broad but carefully conceived 
and executed program described by Crutchfield (1966) for fifth and sixth 
grades. He used a loose style of programming, in large chunks rather than 
small steps, that encourages the children to think about the complex 
materials presented and directs the reinforcement, not toward right answers 
to simple questions, but toward the production of original and relevant ideas. 
Each lesson consists of a problem in story form, illustrated with cartoons, in 
a 40-page booklet requiring about 30 minutes to do, The readers can 
identify with boy and girl characters solving a series of 
and other mysterious occurrences, but the È 
to instruct the reader in he 


the formulation of the probl 


detective problems 
lessons are written and sequenced 
lpful procedures for creative problem solving: 

em, the asking of relevant questions, the laying 
out of a plan of attack, the generation of many ideas, the search for uncom- 


mon ideas, the transformation of the problem in new ways, the evaluation 
of hypotheses, the Sensitivity to odd : 


? and discrepant facts, and the open- 
ness to metaphorical and analogical hints leading to solutions, The program 
nt processes but these were integrated 
problem. Sixteen such lessons, one a 
or four weeks. 


was planned for practice on compone 


into a coherent attack on a concrete 


day in regular classes, required three 


Two evaluation studies, conducted in regular classes, followed the train- 
ing with new problems of the complex o 
solutions. As compared with matched con 
same classes, trained groups of 267 children asked more questions, and 
generated more ideas, which got hi 


fact, the superiority of the trained chi 


on those problems most unlike the 
the transfer to such problems was substanti 
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performance with intelligence appeared; but when the children were divided 
into 3 IQ levels, the superiority of the trained children over the untrained 
was about the same at all levels. And a follow-up test 5 months later, when 
the children were in different classes, disclosed a reduced but still definite 
superiority for the trained children. 

Another evaluation of these procedures and materials (Olton and Crutch- 
field, 1969) went beyond previous studies by extending the training period 
from 4 to 8 weeks, by bringing the teacher into an active role in stimulating 
and guiding class discussion of the materials, and by adding supplementar 
exercises on diversified topics in social studies, science, human relations, 
and current affairs. Although 280 students in fifth and sixth grades were 
involved, the clearest comparison was between 25 fifth-grade students in an 
Instruction group and 25 matched students in a control group. At the end 
of the 8 weeks, the instruction group was distinctly superior to the control 
group on a composite score representing the various productive thinking 
skills. And on a follow-up test of different problems of the same type 6 
months later, the instruction group was still distinctly superior. 

Others have not obtained such encouraging results. Treffinger and Ripple 
C1969) used the same training materials in grades 4, 5, 6, and 7, but their 
Measures of achievement after training were the Torrance tests of creative 
thinking and tests of arithmetic problem solving. By these measures, there 
was no evidence for superiority of the instruction group. They used only 
the programmed materials and concluded that the lack of improvement, 
even on tests of divergent thinking, indicates the importance of the class- 
room environment and of active involvement by the teacher. Transfer to 
Convergent thinking, as in arithmetic problem solving, is perhaps too much 
to expect. x 
x A training program designed by Davis and Houtman (1968) for grades 

» 7, and 8 incorporates checklists of attributes of objects in order to en- 
and a simplified kind of morphological 


courage i ? 
Ourage feature analysis and synthesis, 
about a boy, a girl, a backyard 


Synthesis in the form of an illustrated story 
Sctentist, and a pet bear. A preliminary field test (Davis, Houtman, Warren, 
and Roweton, 1969) compared 21 students who volunteered for the creative 
oes with 32 students who volunteered for a creotiye wining 
. t 10 weeks of training 3 tests were administered, asking the 
Students to change or improve a hot dog, a doorknob, and a coat hanger. 
~n each problem, the students with creativity training produced many more 
ideas and more that were rated as original. These results could be due to 
i Seneral quantity effect, but, in addition, the mean ratings of all ideas for 
originality were higher in the experimental group. As in the Manske and 
avis (1968) experiment, ratings of practicality were inversely correlated 
with ratings of originality; mean ratings of practicality of ideas were slightly 
ower for the experimental group on each of the three problems. Neverthe- 
ess, because of the quantity effect, more ideas that were rated as practical 
Were produced by the experimental group. An extreme example of the 
Quantity effect is the finding that, even though originality and practicality 
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were inversely correlated, the experimental group produced more ideas 
rated both original and practical on two of the three problems than the 
group without creativity training. 

The contribution of the teacher showed up again when an evaluation of 
performance gains on creativity tests compared a printed program with 
regular instructors in 6 high schools (Reese and Parnes, 1970). The 
program consisted of practice on 28 booklets, two 40-minute sessions a week, 
for 13 weeks; the teachers used the same materials but also interacted with 
the students. The students who had the program improved more (from pre- 
test to posttest performance) than students in control groups, but the ones 
who improved most were those who had the additional benefits of the 
teachers’ instruction. 

All in all, these programs for training creative thinking seem to be most 
effective under two conditions: (1) when the posttests be similar to the 
training materials, and (2) when the printed materi 
et ag E ae men i Syal the question of transfer has arisen, but 
hei expect the enefits of the training to transfer? Certainly, 
the improvement will transfer most completely to new problems similar in 
content and format to the training materials. A t 
Atemps to improve cave iaa ing engineering problems, 
propriate training materials—th E n other areas will have to use ap- 

5 e ough the format and materials of the 
present programs may suggest possibilities. 

These programs are multi 
kinds, problem-solving strat 
favorable attitudes toward new ideals. For de 


desirable also if i 
if the various 
components o ini s io 
separately so that the reason P f the training were studied 


TEN Sh sa be specified more distinctly. 
to be due to different components or a oe for example, oppe 

combinations of components at dif- 
of the printed program and of Fick and Bahlke, 1970). The effects 
have not yet been clearly ‘ronment generated by the teacher 


differentiated. N ` 

: ated. INeverthele itude 

of the gains reported under the best ii eless, from the magnitu 
conditions we can infer that these skills 


had not been practiced i i 
previously, in scho 
Jar 1 ol 7 P sell- 
planned practice is worthwhile, The recent dev E at home; and that well 
evelopment of tests of divergent 


production of behavioral ideas CHendricks, Gui 
i S » Guilford, and 1969) 
may shift the emphasis away from the enn mayy ph oop ie 
= aes = N 


lution of Physical problems towar at king about social a 
h b t d creative thinki 

: c and ¢ 
tal problems he fifth grade is not too Carly to start 


als are supported by 


eacher of writing would not 


ple progra Sach; ag ; : 
p'e programs, teaching cognitive skills of various 


egies, j ; à ‘ 
gies, information about creative thinking, and 


eper understanding, it would be 


JUDGMENT TRAINING 


Judgment of the solutions, the conclusive Phase of thought, should ideally 
be a cool deliberative process, but there seems F ought, should ideal! 
consistency between training for this ‘an igh tag to i an in- 

j e ideal of enthusiastic 
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imaginative thought. The inconsistency vanishes, of course, if production 
and judgment are different processes, separated in time, hence the super- 
ordinate assumption of thought control is implicit in such training programs. 
We do not expect that training directed toward a judicious choice among 
tentative solutions will interfere with training directed toward copious 
production of solutions because we assume that the thinker can turn these 
complementary processes on and off adaptively under appropriate conditions. 
It is not likely that any kind of training will make the thinker produce 
more deliberately and choose more enthusiastically. 

This separation of production and judgment in time does not mean, how- 
ever, that improvement in one process cannot affect the other. Some improve- 
ment in the judgment of plot titles as a result of producing plot titles has 
been obtained (Johnson and Zerbolio, 1964), probably because in their 
effort to produce clever titles the subjects clarified their criteria of cleverness, 
which benefited their subsequent judgments of the titles. 

Several types of judgment practice were arranged by Johnson, Parrott and 
Stratton (1968) for training in judging solutions to the five problems 


described above. The training included study of a rating guide which the ex- 
est one of several triads 


Pert judges had developed, practice in selecting the b 
nations for the official 


of solutions—with knowledge of results and expla 
selections—and practice in contrasting superior and inferior solutions. Some 
subjects had the practice individually, others worked cooperatively in pairs, 
and others again had the benefit of tutorial practice with an experimenter; 
but the sequence of activities was the same for all: production of many solu- 
tions, practice in judging solutions written by others, selection of the 
best of those produced by the subject himself. 
3 There are two questions about the value of such judgment training: Will 
it increase the difference between the solutions selected as best and the 
remainder, and will the ones selected as best be better than single solutions 
Written under standard conditions? For the first question, the judgment 
fe ee was eee Among ee ee ae ene 
eee ifference between the quality of any one sub} p 
tion and the mean quality of his nonpreferred solutions was larger for 
Sroups with judgment training than for comparable groups without it. For 
the second question also the judgment training was successful, but there 
Were exceptions. The quality of the solutions selected as best is the joint con- 
Sequence of quality of solutions produced and accuracy of judgment. When 
instructions to produce many solutions debased the quality of the solutions 
severely, even the ones selected as best were no better than the single solu- 
tions produced under standard conditions. Otherwise, however, this brief 
Pron di ee appears to be a useful technique for raising the quality 
ingle solution. 
What the subjects learn during this practice appears to be the criterion 
of Judgment: the conceptualized dimension of quality, what features of the 
Solutions are relevant to this dimension, and what features are irrelevant. In 


328 Y PRODUCTIVE THINKING 


certain circumstances, it may be more practical to obtain improvement in the 
final solution by training in judgment than by training in production. —_ 

If judgment training is really helpful, it should transfer to Recent 
problem, and such transfer was obtained in a study of the pia teen 
problem (Stratton, Parrott, and Johnson, 1970). One effect of the jud ginen 
training was an improvement in mean quality of the sentences written 
around a second set of four words, as compared with control groups without 
judgment training. Relatively few low-quality sentences were produced on 
the second occasion, so the comparison of preferred and nonpreferred sen- 
tences did not give impressive evidence of the effects of the judgment train- 
ing. But a group of subjects who had a multiple-choice test of judgment a 
sentences, without producing any, registered substantial gains on a second 
sentence problem, as compared with control groups. Probably the best sign 
of the effects of the judgment training was that the quality of the preferred 
solutions on the second problem was significantly higher for those who had 
the benefit of judgment training. 

Since judgment training helps subjects to write y E 
select the best of these, the obvious next step was to give the benefit ol 
judgment training to those subjects who write only one solution. When this 
was done, the performance on 
that of control groups writing one and to that of control groups writing many. 
To sum up, the optimal condition among the many that have been im 
vestigated (Johnson, 1968), was to ask the subject to write one solution 
to a problem, with the benefit of information about the criteria for good solu- 
tions and with training in the judgment of solutions. . 

Production and judgment training have both been effective, in different 
ways, so Stratton and Brown (1972) designed a comprehensive experiment 
on writing plot titles, as many as possible in seven minutes, that included 
production training, adapted from Allen’s (1962) morphological synthesis, 
judgment training as described above, and a combination of the two. The 
production training resulted in a large increase in the number of solutions 
written but no increase in the mean 


ment training resulted in a significant increase in mean quality of the solu- 
tions produced and, of course, 


high-quality solutions and 


a second problem was substantially superior to 


and those with judgment training by 


nation of the two types of training was 
not a simple addition, but those who had both types combined produced 


solutions of higher mean quality than those with only production training 
and a larger number of solutions than those with only judgment training. We 
may hope that combined training increases flexibility of thought control, 
leading to high productivity under appropriate conditions and critical judg- 
ment under other appropriate conditions, but this is only a hope at present. 
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Production Controls 


Having identified the processes by which ideas are produced, we should 
next describe the controls on these processes: what turns them on and off, 
what directs and limits them, and how they are affected by training and 
individual differences. The description is necessarily speculative, but the 
research outlined above supplies an empirical foundation. We can discuss 


C 9 P : . 
ontrols on production with reference to the three processes—preparation, 
d which much of the research has revolved. 


D nE? , 
production, and judgment—aroun 
if other names were given to the 


The discussion would not differ greatly 
Processes which account for the empirical results. 

The preparation process is turned on and the preceding routine activity 
ep off by an environmental obstacle which impedes routine activity, 
*y cognition of an unexplained event, or by questions or instructions from 
another person. There are large individual differences in willingness to stop 
ae which go by such names as interest in abstract thought and sensi- 

y to problems. If the individual accepts the problem, then the preparation 
Process continues until he feels that he is ready to produce a solution. This 
readiness for production, which involves a personal parameter as well as the 
objective conditions and the instructions, controls both the duration of the 


Preparati + 5 
Preparation period and its adequacy. 


i The production process, once started, 
eVi . . 
eral other factors identified by the 


is controlled by the preparation and 
above research and discussed below. 


i duration of production is controlled by the criterion of achievement, 
e i S f 4 . 

clings of boredom, and the like. If more than one solution has been pro- 
submitted, conclusion of production triggers a 


duced - k 
cuced and only one is to be a 
l one solution has been 


JUdome š à ` 
‘| Igment process, which typically continues unti 
chosen, } J 


Thus it appears that one process continues until it reaches some criterion or 


pashold, then the next begins. Overlap is also possible: produira T aa 
soluti, the preparation is finished, as when one reads to get i an or : 
ene Presumably, the criterion of readiness for the seon apera ma 
ope ae hed asymptotically, hence it is reasonable to expect that the seol 
ae may begin before the first ceases completely. Judgment often begins 
hes pa solutions have been produced, and the as : Senap 
all sol eads to a retroflex shift in the criterion that controls pa i 
rie ne are judged inadequate and motivation persists, t e judgment 
S initiates another try at production, or even another preparation. 


E 

LASSES AND MODELS 

mehi research and discussion suggest that the production of words and 

T 3 ; Vi 

fi items can be viewed as sampling from a class of items, as in writing 
the names of cities—given the usual instructions, the subject names 


Othe 
Ow: 
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cities rather than automobiles or prime numbers, and the output is controlled 
by his knowledge of the class and of instances of the class. This is a point 
at which the psychology of thought joins the psychology of memory, and an 
explanation of the retrieval of previously learned items to fit specific instruc- 
tions will advance an explanation of productive thinking also. The class may 
be diminished in size and the control increased by additional restrictions, 
such as a request to produce cities that have symphony orchestras or uses 
for a coat hanger that are both original and practical. When several such 
restrictions are added, the assignment approaches memory for a single item. 

It would be a mistake, however, to assume that sampling from a class is 
typical of productive thought. Suppose the assignment is to name cities of 
medium size. The adult subject has a dimensional concept (size of cities) 


but it is not likely that the interval called medium on this dimension 


has definite boundaries. It is more likely that the subject has a concep- 
tualized point, albiet rather indefinite, on the size dimension and that 
production is biased by Proximity to this point. Thus, if someone wrote down 
the names of 50 cities of medium size, a distribution of these plotted on the 
actual size dimension would be unimodal rather than U-shape, and the sub- 
ject’s concept of medium would be better represented by computation of the 
mean and standard deviation than by computation of the boundaries between 
medium and small on the one side and between medium and large on the 
other. The lesson to be learned from this example is that production may be 
controlled by a conceptualized ideal or model as well as by a conceptualized 
class. It may be convenient to talk about multidimensional classes and the 
intersection of sets of solutions, but more often production is controlled by a 


conceptualized model, defined by the intersection of dimensions or attributes. 
Production of solutions to the Sentence: 


first of all, by the class. That is, the pro. i 
a single word or an unstructured list 6 
is biased toward the high end of the ¢ 
of the smoothness dimension, The obtr 
to be low. No doubt there are other d 
menter or subject, which also influe 
some subjects produce mostly short 

None of the records contained obscene words, I 
of solutions is guided by the conceptualized m 
what he puts in, what he leaves out, w 
which he considers secondary, a 


leverness dimension and the high end 
usiveness of the given words is supposed 
imensions, not made explicit by experi- 
nce production. The records show that 
Sentences, others mostly long sentences. 
Hence the subject’s production 
odel which he has formulated: 
hich dimensions he considers primary, 
nd so on. 

RESPONSE HIERARCHIES FROM 
AND ANTICIPATIONS FOR THE 


A naive reader of research on 
impression that the researchers 


THE PAST 
FUTURE 
productive thinking could easily get the 


are opposed to thought; they want their sub- 
jects to respond immediately. But serious thinkers often : 


for a while, analyzing the relevant features a 
solution might be. Observation of a child 


contemplate a problem 
nd anticipating what a good 
playing with blocks will often 
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se Aa inference that he has a preview of his end product—he wants it 
to be big or to x like a ck—ev ofore a > 

TEES for ae a Pa rA AER ey aiga re 
y g his goal. a purpose. Just as a sentence 
is produced in order to communicate, a solution is produced in order to fit 
some specifications, to perform some function. The control of production by 
a response hierarchy is control by the past; the control by a conceptualized 
model constructed to meet certain specifications for a solution is control by 
anticipation of future achievement. 

Analysis of the requirements for a solution often begins by listening and 
reading, which are preparatory information-seeking processes, and if often 
involves a feature analysis. Feature analysis in the perception of a complex 
pattern, as described by Neisser (1967), probably has something in common 
with feature analysis in preparation for production of solutions, although the 
former proceeds much faster. Listening to the instructions for a word associa- 
tion test, for example, the subject figures out what the experimenter expects, 
what role he himself is supposed to play, what attributes of words are im- 
portant, and so on. Studying the chart for the Conclusions problem, the 
thinker sees a time dimension, a program dimension, and probably more 
abstract features. He ignores many other features of the objective situation. 
Records of the production of plot titles illustrate how many different features 
of a plot can be picked out as the basis for construction of a title. The 
thinker then tries to construct a solution to match the requirements as he 
understands them. Hence his production is controlled by his preview of the 
solution to be produced at the same time that it is controlled by the strength of 
the responses that have been produced. 

When the subject knows little about the solutions required, as in writing 
unusual uses and word associations under standard conditions, the influence 
of the response hierarchy is strong. But when he is told, or infers, that the 
Solutions should have certain properties, the relative influence of the response 
hierarchy decreases and the influence of the model increases. This concept- 
ualized model should not be treated merely as a search model, controlling 
exploration of the physical environment and search through the recesses of 
Memory to find the needed item; it should rather be treated as a construction 
Phir controlling the assembly of old parts and the invention of new patterns 
ae Poi pattern of requirements anticipated. As ee ae Saat R 
Silene ae of course, reanalyze the problem, my saree cba 
Spas imensions of the solution, and may reorganize & J 

Juite different model, but this is likely to be a multidimensional model com 


Pose f “ 
Posed of several dimensions and classes. 


THE PRODUCTION CONCEPT 
olution can be called a production 


r The conceptualized model of the desired s 
Concept since it has many of the properties of other concepts and can be 
Investi i i S 

vestigated by the methods of concept research. Consider first the task of 
aring a tree. In the field, the artist’s activity is guided by perception of a 
Specific tree. In the studio, his activity is guided by a memory image of a 
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specific tree or perhaps by a general concept of a tree. Or he modihes E 
general concept to include more abstract, imaginary dimensions: a stun 
tree representing despair, a flourishing tree representing hope. Next, conside 
the construction of a sentence to communicate a complex thought. It is 
reasonable to assume that the speaker's vocal output is guided by a prior con- 
ception of what he wishes to communicate. Such examples suggest that is 
production concept operates positively, as a target or goal toward which 
production is directed. The thinker's satisfaction with his achievement is 
evaluated subjectively in terms of the agreement of the output with the 
conceptualized goal. Discrepancies from that conceptualized goal are rejected, 
as when the artist erases a line that does not give the effect he wants and when 
the speaker interrupts himself to revise his sentence. Thus the control in- 
cludes a feedback component, a recurrent ev: 
goal. 

In the solution of a more abstract problem 
in common with a class concept, such 
A production concept is a general co 
specific solution, just as a class cone 
instance of the class. Like a class conc 
achievement, based on previous exper 
communication. Unlike a class conce 
associated with a standard label, 
idiosyncratic, unlabeled achiev 
solution of the present proble 
of objects and communicatior 
guides production of solutions 
solutions. 


The production concepts that different individu 
tion of a complex problem vary 
features or classes, w 


aluation of progress toward the 


, a production concept has much 
as the concept of sonnet or aggression. 
nceptualization of the solution, not a 
ept is a generalization, not a specific 
ept, a production concept is a cognitive 
ience, present circumstances, and social 
pt, which is standard in the culture and 
a production concept is usually a transient, 
rement of the individual, instrumental only to 
m. Thus the class concept guides classification 
n about objects, while the production concept 
and, to a limited degree, communication about 


als construct for the solu- 
in many attributes, some of which are discrete 
hile others are continuous dimensions, More important, 
some attributes are explicitly included in the task assionment as requirements 
of the desired solution while others are not. If the preblein has many good 
solutions, those attributes included in the instructions are easily incorporated 
into the production concept and normally appear in all solutions. Students 
asked to write plot titles write plot titles and those asked to draw trees draw 
trees. The variations in ariations in ability, to be sure, and 
A included in the ‘conceptualization 
of the desired solution. Thus it happens that most solutions are acceptable in 
that they include the explicitly 


stated criteria, but they vary on other dimen- 
sions that can be treated as additional Criteria of evaluation, It is these 


additional unspecified, or imprecisely specified, dimensions—such as practical- 
ity, style, and scope—that make the production assignment somewhat am- 
2 fo) 


biguous and permit production concepts to vary widely between individuals 
and conditions. z 


The idea of a production conce 


pt is not altogether new. The “anticipatory 
schema” was discussed long ago i 


n the context of experimental introspection 
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(see Chapter 1), and more recently the “plan” was discussed in the context 
of computer simulation (Miller, Galanter, and Pribram, 1960). The research 
reviewed in this chapter ties the old idea more directly to empirical data on 
productive thinking. The hypothesis of the control of thought by a production 
concept cannot be validated by production records alone, however, because 
there are alternative explanations for the records. But if the thinker has con- 
structed a concept that controls his production of solutions, it will also 
influence his judgment of solutions—both his own and those produced by 
others—and his verbal description of his production concept. Let us say that 
past experience and working conditions lead an individual to construct a 
Production concept for a given problem with practicality as an important 
dimension. He will therefore produce practical solutions, within the limits of 
his ability; and, if asked to judge solutions produced by others, he will give 
high ratings to practical solutions. If he is asked to describe his production 
concept by ranking several relevant dimensions of the solutions, he will rank 
practicality high. Other conditions that lead to production concepts emphasiz- 
ing other dimensions (such as originality or comprehensiveness) will have 
Corresponding effects on production, judgment, and verbal description. 


INTERNALIZATION OF CONTROLS 

It is often assumed that, when asked to write a solution to a problem of 
the type considered here, the subject thinks of several solutions before 
Writing down his best one. If so, the process can perhaps be laid open for 
inspection by asking the subject to write these several solutions as he pro- 
duces them. But we cannot assume without evidence that production of many 
solutions is a slowed-down version of production of one solution. There is 
a possible fallacy here, comparable to the fallacy of assuming that percep- 
Hon of a figure. under repeated tachistoscopic exposures is a slowed-down 
version of perception under normal exposure. It is especially hazardous to 
an effective condition, the single-solution con- 


assume ; 
sume that what happens in 
ppens in a less effective condition, 


dition, is a condensed version of what ha 
n multiple-solution condition. 

k Evidently, multiple-solution production has some value as training for 
single-solution production; the two procedures must have something in com- 
in this chapter demonstrates that the 
more restrictive than the con- 
suggests how these controls are 


aT the research summarized m 
trols or on single-solution production are 
: n multiple-solution production and i 
€veloped. When someone is asked to write many solutions, we have seen 
Nat he usually takes this as per andards and write solu- 
Hons of lower quality. When instructions about the criteria of good solutions 
Pii the number of solutions is saer aere man E a 
mS ean that production continues as before oe hat solutions whic do 
R Nie the criteria are then easy Mages iege a with 
inier multiple-solution groups, multiple-so es 5 a wae entera 
of higher mean qua ity but the standard devia- 


effect of the criteria information is to improve 


mission to relax his st 


e. ation write solutions 
$ are 
are not smaller. The 
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i i ikely that the cri- 
quality evenly along the quality scale. Hence it seems likely that t n 
teria information is used by the subject to formulate a more adequate ji 
i it i ied afte cti s occurred. The 
for production, not that it is applied after production has occ rre H 
checklists, hints about strategies, and other materials of the training pr 
grams, along with the rewards from a supportive environment, may in effect 
f= ? ò r. P “i : xe 
supply criteria information. They may aid judgment as well as production, 
but, if the program is successful, the subjects probably build these features 
into the controls on the production process. 
The same argument is relevant to simpler responses, as in discrimination 
i i > i me. tec ~ ako x ses 
experiments of the signal-detection type. The subject could make respons 
and later, by a separate process, judge each response right J 
fact, he is more likely to control his activity in advance, establishing a strict 
criterion of accuracy and achieving a low error rate, or a weak criterion, de- 
pending on the instructions about the payoff or his previous experience with 
it. 
Practice in judgment of solutions has 


or wrong. In 


a similar effect on the control of 
production. It appears that the criteria of judgment, once learned, are 
internalized and so control the production of solutions 
after they are produced. 


The beneficial effects of criteria 
are not properly described merely as 
the number of solutions is reduced, | 
tive one: the construction of 


as well as the judgment 


information and practice in judgment 
a tightening of controls. It is truc that 
but the more important effect is a posi- 
a more specific model for production. 


Major Trends and Implications 


The intellectual glow that radi 


lighting up goals for ambitious y 
a field of study, 


ates from the great thinkers of yesterday 
oung men and women today also outlines 


a division of the psychology of problem solving. The cre- 
ative achievements of the geniuses of history—; 


: $ and those of eminent living 
artists and scientists as well—have been characterized, in varying amounts, 
on such dimensions as intellectu sensitivity to problems, origi- 
e criteria by which these have been 
; dgments, and work samples. Distinc- 
jointly of favorable circumstances and a cre 


al leadership, 
nality, and unusualness of output. Th 
evaluated are product records, peer ju 
tion is rare, the consequence 
ative individual. 

Creative thinkers, identified by 


these criteria and contrasted with com- 
parable men and women, are 


distinguished on the 
gence, good education in their own field, a strong 


their own goals, and an equally strong drive 
Scientists, particularly, prefer intellectually 
to avoid distracting social situations. There a 
creative thinkers in gencral seem to be 


average by high intelli- 
preference for choosing 
toward achieving these goals. 
challenging situations and tend 
te differences between fields, but 
independent individuals, capable 
from their families and teachers 
ers and supervisors. They start early 


of resisting pressures toward conformity 
when they are young and, later, from pe 
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and do their best work when relatively young. Since creative individuals are 
usually studied after they have become eminent and adjusted to the public 
role of deep thinker, some of the distinctive traits may be results of this 
role. But others, such as high intelligence, an interest in abstract thought 
rather than social interaction, and independence, are manifested early in 
life. 

Creativity test scores have been obtained from tasks as simple as the 
association of one word with another, objectively scored for unusualness, 
and as complex as the writing of plot titles, rated for cleverness by trained 
judges. Scores on different tests have enough in common to identify an 
originality factor by the techniques of factor analysis, but it must be a weak, 
poorly defined factor because in many studies the correlations between tests 
of creativity are as low as the correlations of these with intelligence. When 
a battery of creativity tests is administered and the total scores correlated 


lations are in the low and medium ranges 


with intelligence tests, the corre 
for the most part, although influenced by the nature of the sample and the 
asures of educational achieve- 


circumstances of testing. Scores on standard me an 
ment can be predicted better by intelligence tests than by creativity tests. 
But if creativity tests are purified and if the current emphasis on creativity 
actually has an effect on measures of school achievement, correlations with 
intelligence will decrease and correlations with creativity scores increase. 
Attempts to describe the personality traits and adjustment of high scorers 


On Creativity tests, in spite of many inconsistencies, have yielded some agree- 
í thinkers of history. Laboratory 


ment with the distinctive traits of the creative a 
tests of more specific intellectual traits, constructed according to a hypotl e- 
sis about associations, suggest that the high scorers deploy their aE 
broadly and thus receive a broad range of information which they can later 
use adaptively. ’ 1 
Predictions of creative achievement, like other predictions, are ge t, 
and the criteria of creative achievement, like other criteria, complex. Never- 
25s in identifvi iv ientists 
theless, some success has been reported in identifying productive a 
in large laboratories, by a variety of predictors, especially the biograp hica 
data bank. The usefulness of intelligence and education as ba > 
imited by the conditions of employment; but work habits, fami y a 
and activities, and personal orientations that take shape early in F an 
: : : : F use- 

can be investigated through the collection of biographical data, are all use 


ful, 


r While many organizations dependen 
i ti ee 
On are attempting to progress by identif a e 
others are attempting to better the working conditions of the personnel they 


ave. The two policies are not independent; surveys of scientists’ likes and 
dislikes indicate that the optimal conditions for creative work are the con- 
ditions that attract and hold creative thinkers: financial support, laboratory 
acilities and technical assistance, communication with colleagues, and free 
Choice of research problems, methods, and deadlines. iv 

he problems solved in laboratory research on conditions and processes 


t on research and scholarly produc- 
ying and attracting creative thinkers, 


336 *%¥! PRODUCTIVE THINKING 


i Pi > criticized as trivial, but the intent is to dis 
of creative thought can be criticized 


over 
: . E 

eneral principles that will apply also to problems of more substance. 

g a 2 \ í l 

separation of the processes is worthwhile, perhaps ne 


ary, because the 
principles may be specifie to processes. Three candidates, with PR nn 
for their independence, are preparation, production, and judgment. TI a a: 
tion most often consists of collecting information by reading, integrating 
the relevant material, and getting set for the next process. The cial abi 
thus collected is more useful for productive thinking if the reading is done 
with a constructive rather than a critical emphasis. 

As to the old question of the relation between quality and quantity, it 
is clear that subjects instructed to write one solution write solutions of hight 
mean quality than those instructed to write many, but the latter write more 
superior solutions because of a general quantity effect. Among those who write 
many solutions, instructions not to evaluate seem to be helpful but even 
more helpful are instructions that are specific about the criteria of evalua- 
tion. Instructions shift the emphasis from quality to quantity, or the reverse, 


but information about criteria of good solutions makes a more positive con- 


tribution to the production process—both for those who write many solu 
tions and for those who write 

between problems also. | 
divergent continuum h 


7 . . . tri `S 
one. The quality-quantity relation varic 
Problems at the convergent end of the convergent- 
ave onh one good solution, so attempts to produce 


many solutions result in many low-quality solutions: problems at the diver 
gent end of this continuum have m 


many solutions result in only 
studies of individual difference 
between quantity produced 
responding correlations for 
relations betwee 
Records of pr 
hypotheses 


any good solutions, so attempts to produce 
ight drop in quality. For similar reasons, 
s with simple tas 


as 


s show positive correlations 
and number of superior solutions, but the cor 
more substantial problems are near zero. Cor- 
n quantity and mean quality 


are gencrally negative. 
oduction sequences 


afford an Opportunity to test the current 
SSeS, 


about production proce: 
for all the data. On simple 


lowed by the uncommon responses, which are considered of higher quality. 
But there are differences due both to conditions and to problems. Typically, 
instructions to write many solutions are taken as pe 
after which improvement can casily occur. When 
tion and include the criteria for good solutions 
When the problems involve 
duced in the first half, 
Producing solutions without 
drinks without tasting them. 


from several produced by them, the one selected i the average 
2 S better than : 

of the others not selected, but this difference is not large. The solutions 

selected under such conditions are no better, on the average, than com- 

parable solutions produced by subjects who write only one. Even in the case 


of minor laboratory problems, analysis of the thinker’s judging operations 
along with his producing operations generate: 


but no single hypothesis accounts 


tasks the common responses occur early, fol- 


Tmission to lower quality, 
instructions specify evalua- 
» no such improvement occurs. 
convergent thinking, better solutions are pe 


judging them is 


When college 


. . ining 
incomplete—like mixing 
students select one solution 


iption 
S a more complete descriptior 
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of intellectual activity and suggests possibilities for improvement. But the 
judging operations of reviewers, editors, and others who evaluate someone 
else's products also deserve psychological analysis. It is possible to determine 
the relative weights which reviewers attach to the various dimensions of 
complex solutions and perhaps to train the producers accordingly. These 
possibilities illustrate again, in more detail than usual, the dependence of 
the productive thinker on the social environment in which he attempts to 
make his mark. 

Although serious attempts to train productive thinking in industry and in 
school have been carried on for only a decade or so, encouraging progress 
has been reported. The brainstorming techniques taught in courses in crea- 
tive problem solving have resulted in some improvement on tests of creative 
thinking. And so has associative practice in giving uncommon responses. 
The favorable outcomes do not unequivocally support the hypotheses that 
generated the training programs, however. Improvement in performance on 
these simple tasks may not indicate any increase in intellectual skill but may 
$ as from an emphasis on quality to an 
some conditions can raise scores. The 
be considered in the interpretation 
alert subject can easily figure out the 


result from a temporary shift of set— 
emphasis on quantity—which under 
Social side of the training must also 
because in most training situations the 
Criteria of a good solution so as to give t 
and can even learn to play the role of original thinker. 

f ised more substantial problems, 
ences, as training materials 


he experimenter what he expects 


; Training in strategies of production has v 
involving the explanation of mysterious occurr 
This training has included a variety of thought 
i ss that has been obtained 
It does offer con- 


and as tests of achievement. 
Processes as well as class discussions, so the succe 
does not clearly specify the nature of the improvement. 
siderable promise for educational practice. The training transfers to other 

Problems of the same type, perhaps not to problems ofa different type. 
These programs generate an increased inte productive thinking and 
encourage a more ‘play ful attitude toward problem solving. Hence, critics 
tude may explain the 


"VE suggested that the favorable interest and atti i 
m sol and industry where conformity to 


] to conformity of thought. This is a 
al basis of the training, but is it a 


rest in 


ainino's e A : 
i Ming's success, especially in sche 
Nstitut; z 
E Stitutional norms can easily spread 
convinci i ay 7 af 
| vincing criticism of the theoretic: ; eae h 
criticism of the practice? Any procedure, in school or in industry, that be- 
Stows institutional respectability on original thinking cannot be all bad. 
The realization that solutions to a problem have to be screened by some- 
s judgment phase of productive thought. 
s 


one s D 
€ leads to attempts to train the fates ver weal i 
easi ; AA several substantial problems and, 
onable gains have been obtained on se al Ff 


a training procedures, these ga 
ame type, but the extent of the 
A he central question of productive 

though production may be conceived 


ins then transfer to other problems of 
transfer has not yet been determined. 
thinking is one of thought control. 
as association of ideas, or as sampling 


Tom a class of objects, more often it is controlled by an ideal or model of 
a . ? 7 i i S 
the required solution, constructed from the relevant dimensions of the prob- 
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lem situation. While the flow of ideas is biased by response hierarchies 
from the past, it is at the same time controlled by anticipation of the fu- 
ture achievement. As knowledge of the specifications of the solution in- 
creases, the influence of response hierarchies is replaced by the influence 
of the conceptualized model. i 
While it may appear that the production of many solutions is an expansion 
of the production of one, the evidence indicates that the controls are differ- 
ent. Instructions to write many solutions reduce standards and lower the 
quality of output, though some improvement may follow. 
instructions, giving the criteria of good solutions and requiring judgment 
of the solutions, raise the quality of output not, negatively, by more re- 
strictions but, positively, by facilitating construction of a more specific 
production model. The controls could be applied to the rejection and selec- 
tion of solutions after they are produced, but the data suggest, on the con- 


trary, that the controls are built into the conceptualized construction model 
and influence production from the outset. 


More specific 


CHAPTER 8 


JUDGMENT: 
STIMULUS 
DIMENSIONS AND 


COMBINATIONS 


Unidimensional Judgments - 
Multidimensional Judgments in 
Multidimensional Judgments on Cont: 
Some Characteristics of Judges 

Major Trends and Implications 


Discrete Categories 
inuous Scales 


| udgment has been defined as the assign- 
ment of an object to a small number of 
specified categories (Bieri, et al., 1966; 
Jee), Johnson, 1955). The object may be as- 
to one of two complementary classes or to one of 
als on a scale, but in either case the terms 
person to describe an object to another, as 
well as the effect on his own behavior, depend on the 
of the judgment. The categories are specified 
and the judgments are communicated by various sets 
of category names: guilty and not guilty; strongly ap- 
prove, approve, neutral, disapprove and strongly disap- 
prove; 1, 2, 3, 4, 5, 6, and 7. This assignment of 
objects to categories is not random, to be sure, nor is 

he discrimination of a difference be- 


judgment simply t 
tween objects. Logically, judgment has the function of 
settling an uncertain state of affairs—in contrast to 


productive thought, which may be quite unsettling—and 
in judgment experiments the final state is more orderly 
than the initial state. The order may be achieved by He 


several interv 
used by one 


outcome 
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signing a hundred objects to seven categorics er by i veces 
choices and actions; but, in general, the state of affairs that impeded a 
activity and instigated the judging process becomes less confusing, less 
ctive to further activity. 

por dimensions to which attention is directed and responses. aie 
related are determined by the situation in which the judgment mend 
When judgment occurs as the conclusive phase of a complex ‘imeem ing 
enterprise in which the thinker halts his productive activity to eva 
his productions, the critical dimensions are consequences of the ae 
(such as motivation and previous experience) that started all this. W . 
judgment involves the comparison of alternative courses of action, the 
critical dimensions may be as varied as financial gain, intellectual enjoy ment 
or bodily comfort. In psychophysical judgment, the critical dimension is 
usually a simple one, implicit in the set of category names, such as long, 
medium, and short. In general, the thinker tries to match the response cate- 
gories, or output, with certain dimensions of the objects of judgment, or 
input. As Stevens (1966a) put it, after extensive experience with psycho- 
physical judgments by several procedures, “the core of the act of gamete 
is a process of matching [p. 385].” Hence a more complete statement would 
be that judgment begins with unordered objects, events, or persons, assighss 
them to specified response categories so as to maximize the correspondence 
between the responses and the critical dimension of the stimulus objects, a" 
thus ends with a more orderly situation. 

In psychological terminology judgment seems to be the general we 
probably because interest in these processes grew out of psychophy nt 
judgment, but other terms are applied in special situations. The er 
rating appears frequently when judgments are made on a scale of numbered 
categories. If the critical dimension is one of value, as when the judgment 
follows productive thinking, the term evaluation mav be used and the cate- 


gories are likely to have value-laden names, such as acceptable and umac 
ceptable. If the judgment involves the 


ive courses of 
outcomes of alternative courses í 
action, it is often called a decision and then the 


as buy and don't buy. The term 
taste or individuality, and the 
and decisions. 


categories have such iced 
preference is used to emphasize persona 
term choice seems to embrace both preferences 


A judgment can be considered the 
in accordance with the discussion of 
first tentative response is not the fin 


solution of a special kind of problem, 
problem solving in Chapter +. The 
al response. Judgment involves some 
variability within and between sub- 
mediated by a conceptualized dimension, 
In distinction 
judgment is conclusive, not productive. New solutions are not produced, but 
alternatives are decided. These matters of judgment, as we call them, are 
not always settled completely, however; Variations in confidence in the 
judgment may be expected. 

It is the dimension mediating the judgments th 


delav, some uncertainty, and some 
jects. And the response may be 


such as cleverness, guilt, or length. to other problems. however. 


at distinguishes this pro- 
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‘Tike oe If the task is to compare two lines and say which 
“Oh B 7 naa . de elen discrimination as well as judgment. But if the 
ii iret ines bE pe n oe each separate response is mediated 
nicety; many eae eee ne ga panning 
them to make judeme The oth ET OEE ae Sues ess 
neni As moh a he ena human capacity is an ability 
reget = oy ig judging seats objects. Piaget has noted 
next chapter will real th: ? orks wr ae (see Chapter 3), and the 
‘dies _ show that accuracy of judgment is not the same as ac- 

a scrimination. 
P a ged about an abstract dimension that is a common attribute 
i oe i k if erent objects point up the dependence of judgment on 
C eeri A judge cannot follow instructions to rate drinks for 
Bee aa A for g eeoa y e - p the meanings of 
s al conce Ana Ww p: S x i 

ees a i iy named 9 HE en Rll the ang fa 
eta ; ; pe i a i : jects that wary in 
sions and instructed only to rank them, he will rank in re- 

But the ability to make consistent 
same as the ability to name 
ass concept, different mea- 
are usually correlated but 


reki most salient dimension. 
Bt A defin ° i particular dimension is not the 
sures of ea £ = dimension. As in the case of a cl 
thev are a of a dimensional concept 
The », t identical. 
often bl al te for judgment may be 
guilty ¢ sii instructions. You are to judge the i 
fea solely on the basis of the evidence presented in th 
ese weights one at a time 4 al í > i 
s at a time and tell me if the second 
“Rate these children in respect to 
organized, the prepara- 
or directs his attention, 
it specifies the al- 
he stimulus 


instituted in several ways, most 
defendant guilty or not 
his court.” “Lift 
is lighter than, 


entire or the same as the first.” 
tion eine ine a gis of 1 to 7.” However it may be 
By ihe ne ies two things. It alerts the thinker, 

ant attributes of the stimulus objects and 


tern 
ctions, along with t 


ative re : = a 
e response categories. Thus the instru 
nts on judgment, 


materiz 
relatively k the independent variables of experime: 
Tetali asy to manipulate. 
in eon + the instructions may be simple, if the objects in view vary 
ikea peri one dimension, maintenance of a set for one dimension rather 
of Po lin is difficult. Postman and Page (1947) demonstrated the effects 
their sub; ie inhibition in psychophysical judgments of this kind by having 
for ae ee rectangles for height, then for width, and then again 
disturbed ep he precision of the second series of height judgments was 
16 junds 3y the interpolated activity, because of the interference of the set 
Ww ge width on the set to judge height. ie 
hen judgment occurs as the terminal operation of productive thinking, 
one Preparation follows from the dynamics that initiated the thinking, and 
we £ supplies the materials to be judged. In this case, 
tions of judgment is more difficult but, as noted 


not impossible. 


ite thinker himsel 
manipulation of the condi 


nt 
he Preceding chapter, 
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Here we are concerned with stimulus variables, or the input side of judg 
i p oe S “e 
ment, moving from unidimensional to multidimensional judgments; t 
, g ‘ j 
next chapter will deal with response variables, or the output side. 


Unidimensional Judgments 


In referring to the stimulus variables, judgments can be treated as y 
and complex, or unidimensional and multidimensional—though this ` 
tinction turns out to be less clear-cut than it sounds. Unidimensiona 
judgments are those in which the objects of judgment vary mainly along 
the one dimension on which they are to be judged. When the objects vary 
in several dimensions, it often becomes an empirical question which dimen- 
sion or combination of dimensions determines the judgments. There are 
three kinds of stimulus dimensions, and different procedures have been 


elaborated for the study of each, but some general principles of judgment 
have emerged. 


THREE KINDS OF STIMULUS DIMENSIONS 


PERCEPTUAL JUDGMENTS - 
The psychology of judgment started with psychophysics. Methods were 10- 
vented to standardize judging operations so that thresholds could be de- 
termined and perceptual functions studied precisely. But certain constant 
errors—such as series effects, errors of central tende 
—turned up regularly. At first, 
to be eliminated by a more care 
constant errors were interesting 
shifted from attempts to elimina 
them, the psychology of judgmen 


lency, and anchor effects 
these were treated as distorting influences; 
ful methodology; but to some observers = 
in their own right. When the emphasis 
te these effects to attempts to understand 
t began to take shape. 

AFFECTIVE JUDGMENTS AND CHOICES 

A century ago Fechner, the father 
empirical science of esthestics by 
on their preferences for i 
affective, 


» ‘ an 
of psychophysics, tried to start 
collecting data from hundreds of observe! 


è 4 å =e (0 
es tor rectangles of different proportions. Studies € 
or hedonic, judgment ask the observer |} 


; vent 
; how an object or ee 
affects him, how pleasant or unpleasant it seems, which of several he wouie 
’ h : ants 0 

choose. Such judgments may be considered extensions and refinements 
primitive approach and avoidance reactions, coded into communicable te 
sponse categories. Most people adopt a set for these personal judgments 
easily because they enjoy directing their attention to the affective dimensions 
of their experiences. We all know that casual conversation is made UP 
r ik islik ; : a. ; some 
largely of exchanges of likes and dislikes, As an illustration, Fig. 8.1 plots so i 
results of an early quantitative investigation by Saidullah (1927) of th 
pleasantness of salt solutions of different concentra 


tions. 
It is noteworthy that the stimulus materi 


: ate 
als, whether salt solutions, " 

i . srci a 
tangles, or sounds, may be the same as for an experiment on percepti 
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FIGURE 8.1, Affective judgments of salt solutions. The ordinate indicates the number 
of pleasant judgments by several observers minus the number of unpleasant judgments, 
being the maximum in either direction. Data from Saidullah (1927). 


judgment. The difference is in the instructions. The observers are not asked 
to judge the size of the rectangles or the concentration of the salt solutions 
ut the pleasantness of each. 

When someone chooses between alternative courses of action, as between 
going to a movie and reading a book, the basis for comparison is not 
so clear. The actions are apparently unrelated and the affect is anticipated 
tather than immediate. Although there is no obvious dimension common to 


the actions, the fact that choices are made suggests the hypothesis that the 


Judge can compare the alternatives on a dimension of enjoyment or value 
to be expected from each. This dimension is called expected value, and 
under some conditions tabulation of choices permits calculation of the 
scale value of each alternative and prediction of subsequent choices. 


ABSTRACT JUDGMENTS 
any educated people base their judgments on highly abstract attributes. 


in editor decides to publish one manuscript and to reject anaien a Pon 

Sives a term paper a grade of B; a psychologist states that a client's de 
fenses are extremely weak; a merchant estimates the probability of increased 
Sales following installation of air conditioning. All common dimensions on 
which diverse objects are judged are abstract, but some, such as defensive- 
ness and probability, are more abstract than others, such as length and 
saltiness, Naturally, individual differences due to education and experience 
are more prominent in the more abstract areas. 

Good evidence on the judging process also comes from research on eco- 
nomic and social attitudes, because attitude statements often have the form 
of affective judgments of abstract symbols and statements. “I am opposed to 
* State income tax.” “A state income tax may be necessary under some 


YSIONS AN NATIONS 
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conditions.” “A state income tax distributes the wealth.” Judges — 
in the construction of an attitude test have the task of assigning eae a oi 
ment of opinion to a position on a dimension Ceg, from Smal ed 
tax), and the consistency of the judgments indicates the in To bron 
abstract dimension. Such dimensional concepts are properties of the cu 
as well as the individual judges. o ae 
Logicians commonly speak of a judgment as a proposition, or pa 
and judgments of the logical validity of propositions are perhaps the n 
abstract of all. But from the psychological point of view, the judging 
process is a small part of formal reasoning in comparison with other pro 
cesses, hence these matters were discussed in a previous chapter. 


CONSISTENCY AND ACCURACY 


SCALE CONSTRUCTION 


It is not surprising that stable discriminations can be made and or 
objects can be consistently scaled on dimensions for which man has peee 
sensory mechanisms. Consistencies that are obtained in scaling ae 
objects, such as statements of opinion, are more relevant here becaus 
they are dependent on an acquired dimension, a property of the on 
If different judges agree in scaling statements for conservatism, apene 
they agree on the definition of conservatism or not, the agreement Todien 
a shared cultural dimension, in reference to which the statements are judged- 


; ; > that 
Scaling methods are described elsewhere, but it should be noted here a 
the analysis of consistencies and inconsistencies in scaling contributes 
knowledge of the judging process and 


» in exchange, knowledge of magni 
phenomena of judgment improves scaling methodology. Furthermore, scel: 
ing procedures make possible the construction of stimulus materials to spec! 
fications for research. 


INDIVIDUAL PREFERENCES 

When someone in a cafeteria c 
else, he makes a decision that ha 
economists for man 
Such a decision ca 
accuracy, but this 
pletely rational a 


hooses apple pie rather than re 
s interested businessmen and theoretic 
y years and should be integrated with other judgments: 
nnot be scored right or wrong so there is no measure ok 
does not mean that all is chaos. Man may not be a com 
nimal, but some consistency appears in his pe 
the story of research in this area is largely the story of a search for a mode 
that will account for whatever consistencies are found in the choices. 

One type of Consistency that argues for rationality is transitivity of ot 
ences. In a rational world, if A is preferred to B, and B is preferred to ©, 
then A is preferred to C, Certainly, if you choose X dollars rather than 
dollars and Y dollars rather than Z dollars, you would choose X i 
rather than Z dollars. If you like apple pie more than blueberry pie sie 
blueberry pie more than custard Pie, consistency requires that you li P 
apple pie more than custard pie. The relevant experiments have been o 
ducted by offering an adult subject sets of alternatives, such as tickets 
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a concert, a tennis tournament, and a play When choices of single indi- 
viduals made from many sets of such alternatives are examined, considerable 
evidence for transitivity has been found (Edwards, 1954). Conditions can 
be arranged that disclose intransitivity of choices, especially when the al- 
ternatives are multidimensional, but even in such cases some order or 
“semiorder” can usually be discovered (Tversky, 1969). 

The high frequency of transitive choices, among simple alternatives, sug- 
gests that in making his choices the subject is expecting to get more of 
something good or, in general psychological terms, is maximizing expected 
value. Going to a tennis match may be independent of going to a concert, 
but when one is forced to choose between them, the relative position of 
each on the judge's scale of values is critical. Choices made as in the 
example above indicate that the judge has a scale of values on which B 
lies between A and C. It is only an ordinal scale, of course; the order from 
low to high value is C, B, A, but the facts do not show that the intervals 
between them are equal. We do not know whether B is closer to C or to A. 

Even with a scale composed of equal intervals, such as a scale of dollars 
on which $7 is equidistant between $6 and $8, the relation to choices is 
Probably not linear. Fechner pointed out that as the magnitude of a stimulus, 
such as a sound or a light, increases arithmetically, the magnitude of the 
Sensation increases only” logarithmically, and, long before that, Bernoulli 
made a parallel suggestion about the value of money. If you have no money 
at all, the offer of $10 will strongly influence your decisions. But if you 

ave $1,000, the offer of $10 more will have a weaker influence. In worldly 
Units $10 dollars is $10; the unit of value is constant. But in human terms, 
the utility declines with the amount at one’s disposal. Hence decision theorists 

ifferentiate between value, in objective terms, and utility, which repre- 
sents the goal value or the influence on the individual’s judgments. 


e relation between utility and value, though curvilinear, is still a 
Jation. In the case of money, the more the 


on some other dimensions, such as en- 


vi : - 

vap mental temperature, give evidence of the operation of an optimal 

i which is preferred to temperatures either higher or lower. Fig. 8.1 
Ustrates another nonmonotonic relation, also related to physiological 


Mecha, oes S $ 
cchanisms, and the next chapter will include similar graphs for attitude 
a. 


telatiyely a 3 
atively simple, monotonic relatio 


etter : 
€r on either scale. Preferences 


INTER 
JUDGE AGREEMENT ‘ 
vcgments are frequently made in @ systematic manner for many prac- 


a and scientific purposes even though no measure of accuracy is avail- 
is and justified by the agreement between judges. The m cb agreement 
a Ome measure of correlation between judges working independently or 
if Count of the number of agreements relative to occasions for agreement. 
a judges do not reach adequate agreement, the project may be abandoned; 

t many studies of personality have demonstrated that, after judges have 


ticed making ratings with the aid of a code book or rating system, good 
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interjudge agreement can be attained. Most reports of such projects merely 
give the measure of interjudge agreement after training, when a satisfactory 
level of agreement has been demonstrated, but the progress toward agree- 
ment can also be examined in more detail. 

In order to plot a learning curve, two graduate students in clinical psy- 
chology (King, Ehrmann, and Johnson, 1952) rated nursery-school children, 
one at a time, for about an hour each, on such dimensions as sociability. 
aggression, and forwardness. Interjudge agreement was low. Then they 
worked together briefly, defining these dimensions and differentiating them 
from irrelevant behavior, and went back to judging independently. After 
several practice sessions together, they were able to rate independently with 
high agreement (see Fig. 8.2). The improvement was due to common 
definition of each dimension to be judged and 
category of the rating scale, and it did not decline after the training ceased. 

One reason for such high agreement is that both judges observed the 
same behavior and maintained a set for the same dimensions. When the 
judges supplemented their observations of the children with background 
data from the children’s records, the agreement was lower, presumably be- 
cause the judges then made divergent inferences about the significance of 
the observed behavior. 3 

In many training situations—for example 
clinics—the important consideration js the 


of the meaning of each 


» in medical and psychological 
agreement between student and 
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FIGURE 8.2. Consistency of judgment and 
observers rating nursery-school child: 
after training. From King, Ehrmann 


learning. The agreement between two 


and Teral classe i ior before and 
, and Johnson ea social behavior bef 
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teacher or between subject and experimenter. To improve means to empha- 
size the dimensions the expert emphasizes and to use the scale categories as 
the expert does. Some experiments on the judgment of solutions to problems 
in the preceding chapter demonstrated how information about the critical 
dimensions of evaluation and practice in using such information brought 
the subjects’ judgments closer to the official ones made by the experimenters. 

An increase in agreement between judges usually indicates an increase 
in accuracy; raising reliability raises validity. But there are exceptions. The 
increase may come from reduction of errors peculiar to each judge, to be 
sure, but it may also come from agreement on a bias or stereotype, from 
domination of one judge by another, and from nonjudicial group processes 
currently under investigation by social psychologists. If four judges practice 
as two pairs and within-pair agreement increases, will between-pair agreement 
increase likewise? Fragmentary research indicates that the between-pair 
increase is much smaller than the within-pair increase, but the conditions 
that influence the outcome have not been thoroughly explored. 


STIMULUS INFORMATION AND ACCURACY OF JUDGMENT — 
van be counted, but this measure 


For some purposes errors of judgment c 
fluctuates widely with the conditions of judgment. In the past two decades, 


Many experiments have employed a more stable measure, the amount of 
information transmitted by a judgment. The measure of information is the 
it, abbreviated from binary digit, which is the logarithm to the base 2 
of the information in the system. Suppose, for example, that the stimuli 
are red and green cards, presented equally often, and they are to be assigned 
to categories R and G. The uncertainty as to which of the two will appear 
8 One f£ 2 is 1. Now if the judgments are correctly made, 
this uncertainty is reduced, and information transmitted is defined as un- 
Certainty reduced, which in this case is one bit. If the stimuli are red, yellow, 
Plue i i sented equally often, the information in the set 
ight objects of judgment the information is 
F ncreased, the information trans- 
t in the information-processing 


bit, since logs o 


is ee ay, green cards, presei 
tiren te C and for a. set of € bjects 
Tee bits. But as the input information is i 
Pies by the judgments approaches a limi 
apacity of the judge. 
Ne experiment requiring judgments of pitch Pollack, 1952) got per- 
ect transmission for 2 pitches Cie 1 bit), 3 pitches, and 4 pitches Cie., 
; bits), but above 5 pitches accuracy fell of. The upper limit was about 
erfect discrimination of five items—whether 
was 6, 7, 8, 10, or 12. For judgments of 
Oudness, the upper limit found by Garner (1953) was 2.1 bits. Apparently 
the limit for judgment of hues is larger, about 8 hues or 3 bits CEriksen 
and Hake, 1955), Several conditions shift this limit over the range from 5 
Son items, but it is fairly constant for physical stimuli that vary on 1 dimen- 
| and may be symbolized by the magical number 7, plus or minus 2 


Min 


iller, 1956). Recent calculations suggest, however, that when the number 


Bite Aas 
Its—which corresponds to p 


ne 
number of stimuli presented 
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of stimuli is increased beyond the number that can be processed with only 
a few errors, the amount of information transmitted may actually de 
(MacRae, 1970). SA 

It is possible to increase accuracy in categorizing a set of blocks of differ- 
ent sizes by coding them, as by painting them different colors. In one ex- 
periment (Eriksen and Hake, 1955), use of any | of 3 dimensions Chue, 
size, or brightness) to code 20 stimuli resulted in the transmisson of 2.75 
bits, but when each stimulus was coded redundantly by 2 of these dimensions, 
the value was 3.43, and when each was coded by 3 dimensions, it was 4.11 
—although the stimulus information in the 20 stimuli remained constant. 

The same kind of analysis has been applied to social judgments by having 
clinicians rate the behavior described in short paragraphs on separate dimen- 
sions such as aggressiveness and dependency. Several values for the informa- 
tion transmitted by these judgments, assembled by Bieri, et al. (1966), vary 
from 1.71 bits to 2.12 bits, somewhat lower than for physical stimuli. 
Attempts to increase information transmitted by redundantly combining 
two types of behavioral information have not vet succeeded. 

The general principle under investigation in these experiments is that 
when a set of objects varies on several dimensions, accuracy of judgment 1$ 
increased by redundancy of coding. Most objects of commerce and industry 
vary in size and color, for example, and sorting errors are less frequent if 
objects of the largest size are all red, the medium ones all yellow, the smallest 
all green, and so on. But this principle is too simple because redundancy can 
take several forms. If we imagine size and color defining a two-dimensional 
space, the coding can be arranged so that the stimuli are scattered throughout 
this space and the distances between neighboring stimuli can be increased 
or decreased even though each size is still redundantly associated with a 


single color. Lockhead (1970) has shown that stimuli manufactured to 
maximize distances in this two-dimensional cognitive space were identified 


ae bie yb than comparable stimuli that minimized distances. The 
subjects of his experiments apparently did not judge separate dimensions 
and combine them; rather, they acquired knowledge of the two-dimensional 


space through experience with the stimuli and then judged them as if they 
were locating each in the cognitive space, i 


_ Furthermore, the redundancy Principle applies only to integral dimen- 
sions. Typically, an object h ining , T hat 
eu ject has an intrinsic hue, size, and brightness; tha 

dund: f TERG as properties of one object. In such cases, 
redun ancy (0) coding Increases accuracy, but when extra dimensions are 
added on, the effect may be different (Garner, 1970). Coding a set of objects 
by redundantly adding Separate features (such as letters or arrows) may 
have an interfering effect, probably because instead of making unitary 


judgments of the objects, the subjects make separate judgments of the 
dimensions. This seems to be the case for abstr: 


š ; : act social judgments in which 
adding redundancy is not effective. 


As Garner points out, one person may 


ease 


is, these dimensions are perceived 


é make Separate judgments of stimuli 
that another person organizes into unitary objects; the flexibility of human 
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intellectual capacities must be considered along with general principles of 
information processing. Highly practiced musicians, artists, and winetasters 
generate new dimensions and treat them as unitary while novices get 
confused by the separate elements. The principles that influence accuracy 
of judgment are of limited generality, but specific data on stimulus informa- 
tion and accuracy of judgments under specific conditions have accumulated 
(Bieri, et al., 1966; Garner, 1962). Accuracy of judgment is also influenced 
by the scale on which the judgments are expressed, as we shall see in the 


next chapter. 


CORRELATION OF RESPONSE SCALE WITH STIMULUS SERIES 

Under some conditions, the association of judgments with stimuli can 
conveniently be measured by the correlation coefficient. Whereas the in- 
formation-processing model treats judgment nonmetrically as the coding 
of discrete stimuli into discrete categories, the correlation model treats judg- 
ment as the adjustment of an ordered scale of response categories to an or- 
dered series of stimuli. The judge attempts to correlate the response scale 
as a whole with the stimulus series. And, if the stimuli are measured in 
objective units such as inches or grams, and the responses are numbered, 
as from 1 to 9, the conventional correlation coefficient measures the accuracy 
of the overall performance. As an illustration of this approach, Fig. 8.3, taken 
from Johnson and Mullally (1969), shows how the correlation between 
judgments and stimuli varies as a function of time of exposure of the 
Stimuli, 


Multidimensional Judgments in Discrete Categories 


s to be judged vary in several 
laim on the judge’s attention. 
always unidimensional when 


Most judgments are complex. The obiect 
A oins; none of which has a unique © 
nd ostensibly simple judgments are not 
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studied empirically. Judgments of the heights of blocks are influenced by 
their widths. The affective dimension biases judgments of formal proposi- 
tions Cas noted in the chapter on reasoning) and even judgments of the 
size of coins. The complexity of judgments of people, often technically 
called person perception, is matched only by the fascination they hold for 
biographers and psychologists. What appears to be a simple judgment on 
a prominent dimension, when taken seriously often turns into a comparison 
of the weights of different dimensions. 

In this diversified world that we are continuously evaluating, the corre- 
lation between stimulus dimensions is a basic condition of the evaluation. 
The experiments on stimulus information mentioned 


objects with perfectly correlated dimensions, 


so that judgments made on 
the basis of hue would be identical with judgments made on the basis of 
size. But that is a very special c 


ase. The judge is more often faced with 
situations in which the dimensions are moderately 


and weight in judgments of persons, and w 
correlation between dimensions is negligible. 

There are three general questions about multidimensional judgements: 
A) What are the dimensions that influence the judgments? (2) Tow much 
weight does the judge give to cach? Weighting dimensions differentially 
implies refined quantitative judgment, as compared to the concept experi- 
ment, in which the distinction between relevant and irrelevant dimensions 
is more clear-cut. And (3) How are these effects combined in the judging 
process? Individual differences between judges can be expected in the 


; i vell as the others, and a judge may shift in 
any respect as his experience accumulates, i i 


When the judgments are simple and the d 
approach, such as the tabulation of judon 
various conditions, may be the most suitabl 
of the principal features of the 
dimensional judgments have 
have become both more 


above used stimulus 


correlated, such as height 
ith situations in which the 


ata imprecise, a crude empirical 
nents of various stimuli under 
utable for a preliminary description 
Operation. But i 
attracted more rese 
Sker eis eras and 
chologists have preferred to begin wi 7 i j x4 
activities Presen iive or nötnate With a hypothetical model of the judges 

S prescribe, like the norms of logic» 


in recent years, as multi- 
arch attention and the data 
more interdependent, many PSY” 


what the judge should do, while deserts, é ‘ 
te ae J A oh ee e descriptive models attempt to describe what 
the judge actually does—and some models have both functions. Most 
‘inca form, and judgment data Cmore 
or example) have the precision neces” 
: model, 

The elaboration of judgment models has not t 
not unified the field of judgment. Most models 
situations in which they were developed. Furthe 
are not often comparable because they lead t 
poe eee eae ache: measures. They all have in common, 
however, some respect for the rationality of the juc 
he can analyze, and attend to, different attribu 


models have been stated in mathem 
often than problem-solving data, 
sary for evaluating a mathematical 


aken over and certainly has 
are applicable only to those 
tTmore, models for judgment 
o different experimental ar- 


judge in that they assume 
tes of the stimulus objects 
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and that he tries to fit these to the response categories in some consistent 
fashion within the restrictions specified by the instructions. 

Although the grand psychological theories took shape in other areas, the 
field of multidimensional judgments is an excellent arena for psychological 
controversies. All the classical antinomies are here: the subjective versus the 
objective, rationality versus mechanism, the accumulation of empirical facts 
versus model building, addition versus configuration. The experiments in- 
volve adult human subjects thinking about matters of some significance, yet 
the data can be collected with considerable precision. 

- It will be convenient to discuss judgments in discrete categories first, then 
judgments on a continuum. 


SERIAL PROCEDURES 

The serial nature of many judgments should be mentioned first because 
this is a common way of organizing heterogeneous data into a final decision. 
Consider the case of a grocer debating whether to expand his business. He 
may first raise the question: Can I increase sales volume? If his best guess 
is No, the matter is settled. If he can say Yes, he goes on to the next hurdle: 
Can I acquire more space? If the answer is No, the matter is settled at this 
Point. If this second hurdle is surmounted, he goes on to the third: Can I 
finance the addition? And so on. The overall enterprise may be viewed as a 
“ecision tree made up of branches leading to other branches. 

When the data are all available at once, the judge has a choice between a 
Simultaneous and a serial strategy, but in some situations the data are col- 
lected serially as a consequence of prior decisions. The evaluation of appli- 
cants for a job or a scholarship, as well as medical and psychological diag- 
nosis, often proceed by a succession of decisions, each of which influences the 
collection of more information for the next decision. The employment official 
reads a letter of application and tosses it in the wastebasket or requests the 
applicant to take a test. If the test score is below a critical point, the appli- 
Cation is finished; but if it is above the critical point, he is invited for an 
Interview, 

Abie" this procedure a series of categorical decisi ae a 
are in discrete categories or that critical scores di i 
Avorable and unfavorable categories. It is true that at the final summing up 
: € judge may examine all the data on hand and that distance above any 
Critical score may enter into the final evaluation. But it is also true that the 
information available depends on prior decisions. In medical diagnosis, the 
Number of possible tests is so large and the information resulting from them 
50 variable that the decision which to recommend is as critical as the evalua- 
tion of the test results. The rationale is one of efficiency; but the most efficient 
Sequence in these cases, as in information seeking in general, 197a Jort func- 
Poss of information and cost. Thus, a m for pa p Su 
com, ; : vell as evaluation of the results. In 
other re co 7 S Ls ae com the human judge and the com- 

o, the optimal linkage bety 


Puter isa serial one (Shelly and Bryan, 1964). 


ons assumes that the 
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This description of serial judgment does not explain how any one decision 
is made, but it does show where each fits into the larger enterprise. 


JUDGMENT MODELS FOR CLINICAL DIAGNOSIS 


A SYLLOGISTIC MODEL 

Since clinical psychologists often assign patients to diagnostic categories 
and make categorical statements about them on the basis of multiple cues, 
they have a special interest in multidimensional judgments. In discussing the 


problem of clinical inference, Sarbin, Taft, and Bailey (1960) suggest a 
syllogistic model. 


All fulminating schizophrenics are dangerous; 
This man is a fulminating schizophrenic; 
Therefore, this man is dangerous [p. 52] 


The major premise is a generalization derived from reading, experience, 


and implicit theories, while the minor premise comes from examination of 
the individual. Within this structure the familiar logical fallacies can be 
identified, as well as valid conclusions. z 

Probabilities may be included in the major premise, such as: The frequency 
of association between improvement in 


; psychotherapy and intact ego, mil 
anxiety, and high motivation is six in te 


n. If the clinician assumes that this 
he operates with it as his major 
» he states a minor premise, such as: 


i 5 ons characterized by intact ego, mild 
anxiety, and high motivation. In formal logic there is no valid conclusion to 


this syllogism because it is not known whether Jones belongs to the 60 per- 
cent who improve or the 40 percent who do not. But if a decision must be 


made, it can be stated in two ways. The probability that Jones will improve 
k E as ” 
in therapy 1s SIX In ten, or the statement “Jones will improve in therapy 
has a credibility of 0.6. 


percepts, for example, makes a diffe 


the number of percepts is large than when it is small. Configural interpreta 
tion is more difficult to master, of course, than simple counts of Rorschach 


DECISION RULES 


The information-processing approach to clinical judgment is exemplified 
in a preliminary study by Kleinmuntz (1968) of an expert clinician’s use 


of information from the Minnesota Multiphasic Personality Inventory. The 
expert was given the task of Sorting the test profiles of ie college students 
into 2 categories, adjusted and maladjusted, a task that consumed 60 hours: 
and to “think aloud” into a tape recorder while doing so. From records © 
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the classifications and accompanying comments, Kleinmuntz was able to 
formulate 16 decision rules that the judge had apparently been using. One 
tule consisted simply of addition of signs of maladjustment: If the scores on 
4 or more clinical scales exceed 70, call the student maladjusted. Other de- 
cision rules were more complex, including interactions between signs: Call 
the student maladjusted if Pa exceeds 70, unless Mt is less than 6 and K is 
more than 65. Clinicians often generalize such rules for themselves and, if 
they are explicitly stated, they can be communicated to others. In fact, 
Kleinmuntz undertook this search for explicit decision rules as a first step 
toward programming a computer to make the diagnoses. 

Do clinical psychologists actually make clinical judgments according to the 
sophisticated models that the experts advocate? This is a case in which think- 
ing makes it so; a psychologist is likely to adopt a model that he accepted 
when he was exposed to it as a student. Students follow their leaders, as a 
tule; but if the leader's path is not well marked or if it seems unnecessarily 
circuitous, the student may take an easier path—perhaps additive rather 
than configural. The question whether clinical signs are combined by addi- 
tive or configural rules will come up again later in this chapter for judgments 
made on continuous scales. 


THE DISCRIMINANT FUNCTION 
hat different when he is required to classify 
dmit and don’t admit or hire and don't 
ire—on the basis of multiple dimensions, with only the restriction that the 
same number be assigned to each. People differ in many dimensions but 
they do not come in two types. This task is not like the concept attainment 
task that requires discovery of the critical binary attributes, therefore, but 
More like the sorting task that requires the subject to “put those objects to- 
ether that belong pebas” As on the sorting task, different subjects may 
© expected to emphasize different dimensions. There will be some overlap 
“tween the categories since certain individuals could be assigned to either, 
Ut it is reasonable to assume that each judge will try to maximize the dif- 


e : 
tence between the two categories. 
he discriminant function was de 


The Judge’s task is somew 
People into two categories—such as 4 


vised to solve statistical problems of this 


kind, so Rodwan and Hake (1964) raised the question whether this statistical 
model would describe what serious judges actually do when they combine 
the information on several dimensions to make two-category judgments. The 
“scriminant function resembles the multiple regression equation. Whereas 
the latter maximizes the correlation between the weighted sum of the 
Predictor variables and the criterion, the discriminant function is a weighted 
Sum that maximizes the discrimination between the two classes. Both pro- 


cedures yj : ioh 

yield a set of optimal weights: 

© test this model ae and Hake used 81 photographs of schematic 
; th of nose, Xı; length of chin, X»; 


a s 5 
ices that varied in 4 dimensions: leng 
istance between eyes, Xs; and length of forehead, X4. There were 3 values 


of each dimension. but other lines in the faces were constant. (The resem- 
A 
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blance between these materials and those used in conventional experiments 
on synthetic concepts should be apparent.) The subject's task was to judge 
each face intelligent or not intelligent. For the data of one subject, told that 
half the faces were supposed to be intelligent, the discriminant function 
yielded these weights: —.21X; +.62X, —.07X, +.76X,; that is, this sub- 
ject gave a small negative weight to length of nose, a strong positive weight to 
length of chin, and an even stronger positive weight to length of forehead, 
while practically ignoring distance between the eyes. For another subject, 
the weights were: +.09X, +.31X» —.22X, +.92X,. Three men and 1 woman 
weighted length of forehead, X,, positively, but 2 women weighted this 
dimension negatively, perhaps as a reaction against the common stereotype 
Evidence for the validity of the model is that the equations reproduce the 
obtained judgments with small error. Another test varied the restrictions on 
the number in each category. The judges were told, when given a third of 
the faces to judge, that 25 percent were supposed to be intelligent; then, 
when given the next third to judge, that 50 percent were supposed to be in- 
telligent; and for the other third, 75 percent. The model requires that each 
subject weight each dimension the same regardless of these percentages, and 
the data indicated that they did so, within narrow limts. Furthermore, the 
weights were stable over a three-week period. 
A realistic feature of this model is that it allows for differences betwee? 
judges in the weights given to the four dimensions—some actually were 
ona zero—but they were stable under different conditions. The model as- 
sumes that the contributions of the different dimensions to the categoriza- 


nits were linear and additive, and the fit to the data indicates that the 
Judges operated approximately in this fashion. 


DECISIONS ABOUT ACTIONS WITH UNCERTAIN OUTCOMES 

It is reasonable to assume that w 
apple pie and custard pie, he is sim 
jective value, or utility, 
cide between alternativ 


hen a person makes a choice betwee? 
3 ply choosing the one that has more su?” 
fo him. Risky choices, which force the person to de- 
each, requir ss a when he is not certain of the outcome ° 
eh, require a more complex decision model. Consider the decision, before 
going ret sane to wear a raincoat or not when the weather man has given 
the probability i 
oei a. be i eerie four outcomes to be considered are = n 
(positi 3 nce the model requires estimates of uti 1y 
positive or negative) for all outcomes, we begin with the worst outcome, the 
disaster of being caught in the rain witho! fa oe oe utility 
of —100 (lower left cell). On this scale wee ee and give it doe 
i | n this scale, wearing a raincoat when it does 
not rain would be a minor nuisance, with a utility of —20 Cupper right 
cell). Wearing a raincoat when it does Tain miglit he er sma 


triumph of good planning, worth brief mention at lunch, so we give jita 
utility of 10. Not wearing a raincoat y £ 


i hen it d zi i utine 
i i res oes n s the ro 
state of affairs, neither positive or negative ot rain i 


A virtue of this model is that it takes into account the probabilities and 
utilities of all four outcomes. Since the probability of rain is .3 and 
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TABLE 8.1. MATRIX OF ALTERNATIVE ACTIONS AND OUTCOMES 
TO ILLUSTRATE A DECISION MODEL. THE PROBABILITY 
OF RAIN IS ASSUMED TO BE .3; THE CELL ENTRIES 


ARE SUBJECTIVE UTILITIES. 


Outcomes 
ee 
Rain No rain 
p=3 = 7 
i 
Actions Wear raincoat 10 —20 
No raincoat —100 0 


utility of wearing a raincoat in this event is 10, the expected utility of this 
Outcome is .3(10) or 3. To this must be added the expected utility of wear- 
ng a raincoat if it does not rain, which is .7(—20) or —14. Therefore, if 
the decision is to wear a raincoat, the sum of the expected utilities is — 11. 
Symbolically: EU = p,U, + p2U2. The same computations for the decision 
Not to wear a raincoat yield an expected utility of —30. All things considered, 
the model says that not wearing a raincoat is probably worse than wearing 
one, Wearing one would be the judicious thing to do. There may be more 
alternative actions to choose from and more probable outcomes to consider, 
ut this simplified illustration outlines the structure of the model. 

Actually, it is difficult to get good quantitative estimates of the utility of 
Outcomes, other than money outcomes, which is why many of the investiga- 
tions of the psychology of decision making are concerned with a simpler 
Model of expecte d valne EV = pV—the expected value of an event is the 
Monetary value of the event multiplied by the probability that it will occur. 

UPpose you are offered the following gamble: Toss a half-dollar coin in the 
air and if it comes down heads you win the coin. The value of the coin is 

cents, and the probability that it will come down heads is .5, hence the 
Expected value of the gamble is 25 cents. If you are asked to pay 30 cents 
oat this game, the price is too high. Naw suppa ei ad pa anather 
e: Pick a card from an ordinary deck of playing cards an if it is a 
Spade you win $2 Compute the expected value of the gamble. If the two 
alternative shies were offered to you at the same price, which would you 
Prefer? The expected-value model states that the gamble with the highest 
SXpected value will be preferred. In cases where a loss (symbolized by L) 
va Well as a win (W) is probable, the decision model states that the optimal 
tTategy is to maximize the quantity: pwVw — pV 5 
ecision models of this nature are being used to organize the voluminous 
ata involved in hich-level economic and military decisions. The question for 
Psychology is rave well the model describes the decision-making processes 
; ordinary educated adults. From a broad perspective this question is 
lg te quo ked in A OYAN eNaP e e sene dn A 
ee scribe the reasoning of ordinary . : : clear, thus 
> that the fit to human decision making is close enough that the model 
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is a good starting place from which to examine the discrepancies from it and 
to try to improve it. At least it is better than the assumption that decisions 
are determined by random behavior, or nonrational associations, or naive wish 
fulfillment. 

From the psychological point of view, decisions under uncertainty are 
multidimensional judgments involving at least two dimensions—probability 
and utility Cor subjective value). The basic assumption that these are joined 
in a multiplicative combination is a reasonable one, as a starter. But many 
people do not understand probabilities very well, and the utilities of outcomes 
are personal, affective magnitudes. Hence, theoretical analysis and empirical 
research have been directed toward the quantification of ‘these two dimen- 
sions as people actually employ them (Becker and McClintock, 1967). 


ESTIMATING PROBABILITIES 
Most adults estimate probabilities from a logical analysis of the alternative 


events or from observation of frequencies of past events of the same class 
tolerably well, but certain constant errors have 


been observed both in analyses 
of bets at racetracks and in laboratory 


5 experiments. Low probabilities are 
often overestimated and high probabilities underestimated, perhaps because 


of a regression effect that will be discussed in the next chapter. Another 
common error is the so-called Monte Carlo fall 
of independent events of one kind (eg., gi 
probability for a recurrence of that event ma 


native Ce.g., a boy baby). This fallacy probably has its origin in card games- 
Withdrawing a red card from a deck of 26 red and 26 black cards, without 
replacement, decreases the number of red cards i 
of drawing a black card next time. The 
to the transfer of experience from a close: 
open system of independent events—wh 
(1960) traced the development of sucl 
concluded that “no one guesses in a ma 


the a i of 6 to the very bright adolescent of 16, predictions follow 
complex system of pattern-seeking, associated with preferences for aesthetie 
Sen or balance, fairness, and beliefs in magic [p. 36].” 

The negative implication i j tines i 
> A . oe is awkward in making decisions under risk, 3 

me ing inferences rom categorical propositions; the probability that an 
event will occur and that its complement will not occur do not always sum 
to unity, as conventional probability theories require. Sums different from 
unity as in more common when there are more than ine probable even 
5 : 
tc vy) er. It : true that many otherwise intelligent people concentrate Ga 
A pa and are not impressed by statistica] notions of longrun PFO” 

JEVI 1 + z E 

ability. In general, however, subjective probability is roughly related tO ob 
jective probability—though the relation js usually not linear and is somewh4 
dependent on the decision situation. k 

After reviewing a large number of e 
students with samples of data and req 


acy, the tendency after a run 
rl babies) to estimate a low 
l higher probability for an alter- 


and increases the probability 
Monte Carlo error is probably due 
d system of dependent events tO an 
ich occurs also in reasoning. Cohen 
h predictions in schoolchildren and 
From 
sa 


nner without rhyme or reason. 


ege 


Xperiments which presented coll es 


uired them to estimate probabilit! 


MULTIDIMENSIONAL JUDGMENTS IN DISCRETE CATEGORIES ‘ 357 
Proportions, means, variances, correlations, and other parameters, Peterson 
and Beach (1967) concluded that man is a fairly good “intuitive statistician.” 
The most frequent discrepancy from the statistical model was in the conserva- 
tive direction. The subjects failed to extract all the information in the 
samples that they observed and thus hesitated to make extreme estimates. 
But by and large their inferences approximated those that would be made 
by the appropriate statistical model. Hence the students would be entitled to 
a grade of A— for making reasonably optimal decisions, and the statistical 
models would also be entitled to a good grade for moderate success in de- 
scribing human behavior under the uncertainties of a probabilistic environ- 
ment, 


BETTING PREFERENCES 

The utilities of such outcomes as going to a concert, getting caught in the 
rain, and having baby girls are difficult to quantify, hence much of the re- 
search has dealt with the utility of money. One procedure is to give the sub- 
ject a choice of bets, or gambles, as in the example above. A simpler pro- 
cedure, like a lottery, has the subject purchase a gamble: “You can win three 
dollars if you Gane + spade from a pack of playing cards. What will you pay 
for this chance?” Following this procedure, instead of the two-category de- 
Cision, Komorita (1964) had college students buy many gambles and found 
that power functions—characteristic of many psychological magnitudes—for 
both probability and utility accounted for the data quite well. The subjective 
distance, which is what counts in making these judgments, from a probability 
of .25 to one of .50 was much greater than from .50 to .75. For the boys, the 
utility function of money was practically linear with dollar value, but for 
the girls the utility of the larger sums was relatively smaller. The good fit to 
the data indicated that the subjects were reasonable, each in his or her own 
Way, in basing their decisions on the objective probabilities and the money 
Values, z 

Since decision experiments need an adequate sample of bets, they extend 
Over considerable time: yet the shifts in preference for bets during an experi- 
ment have been surprisingly small. Stable strategies of evaluating the situation 
and choosing bets presumably reflect previous learning and thinking in similar 
situations. The experimenter can manipulate the payoff to the players, how- 
ever, if he has sufficient capital. By this maneuver, Greenberg and Weiner 
(1966) found that the preferences of the bettors changed with the number 
Of wins and losses. Those who had as many wins as losses became more con- 
Servative in their betting than those who had either high or low reward 
ratios. This is understandable when we genic that a es oe like 
anythj ; rsonal, even boring, task. en the player 
a ects onan see A this adds not only information about 


: Il (Meyer, 1967). When col- 


the system but a motivational increment as well (I i 

ege students are overworked by repetitive decisions about gambles, they 
often descend to a simple strategy such as always choosing the gamble with 
“ae higher probability of winning or always minimizing the maximum loss, 
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À ie ees r 

even though they are capable of following a more complex strategy unde 
&. ed . . » pj a 5 A 

better working conditions (Slovic, Lichtenstein, and Edwards, 1965) 


DEVIATIONS FROM THE DECISION MODEL Shee fairly 

Since the expected utility, or the expected value, model describes ray 
well what the average adult does under certain conditions, the conditions 
under which the model does not fit are particularly instructive for the psy- 
chology of judgment. For one thing, the average adult does not always nid 
mize expected utility by considering all possible outcomes. Everyone knows 
when buying insurance that the insurance company takes a percentage to 
expenses and profit—though the magnitude of this percentage is not generally 
appreciated—so the expected value to the buyer, if properly calculated. must 
be negative. But people do buy insurance, following what is called a minimax 
strategy. The buyer considers only the maximum loss from a single catas- 
trophe and tries to minimize it. 

Like the insurance buyer, the average amateur gambler knows that the 
house takes its cut but he also does not maximize expected utility in the pre- 
dicted way. Either he focuses on one outcome and ignores the others oF 
gambling has a utility of its own that should be entered in the equations: 
When Cohen (1960) offered English schoolchildren a choice of a small 
prize or a chance of winning a large one, the majority preferred to take a 
chance. At age nine about 94 percent preferred to gamble, at age fifteen only 
54 percent. Perhaps shopping has a utility of its own for some people: at 
lense, many merchandising schemes operate on the assumption that the cur 
tomer wants entertainment in the form of a raffle or a plus value outside the 


y valu is willi . Š sa aking 
money value and is willing to pay extra for it. Even rational decision maki § 
may have a utility of its ow jon 


theorists of this particular de 

The complexity 
problem situations, 
more likely 


n; practical businessmen have accused decis 
viation from rationality. ' 
of the situation is an important factor here as in aher 
College students choosing between alternative bets a 
; to make choices that fit the expected value model, and thus ate 
optimal choices, if the bets are simple ones that are casily understood: : 
expected value model does not fit so well when the hen are complicate“: 
Choices between hypothetical gambles are not the same as choices betwee” 
gambles that must be played (Slovic, 1969b), And the Frequency of winning 
makes a difference even when expected lua is held oti Millet 


; is a 
Meyer, and Lanzetta, 1969). Even though the expected value model pA 5 
fairly good first approximation, the discri e 


e a ¢ too 
i ancies ar arge and 
quent to ignore. P ies are too la g 


If we treat the gamble as a multidimensional judament, four d 
must be considered: the value of a win, tH : PER Ae f in 
; Na » the probability of a win, 
of a loss, and the probability of a loss, It is mathematically reasonabl p 
sume that values are multiplied by probabilities; but instead of E ia 
on, one can, in a more empirical way» ask stein 
nto the overall Judgment. Slovic and Lichter” 4 
estion, not by asking their subjects to pins 


‘ons 
imensio? 
the valu 
e to as” 


with this normative assumpti 
these four dimensions enter j 
(1969) approached this qu 
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tween gambles, but by asking them to rate the attractiveness of each. Then 
they used correlation procedures to determine the weight attached to each 
dimension. This correlation model of decisions under uncertainty will be 
discussed along with other such models in the next section, but the large 
individual differences that turned up should be mentioned here. Some sub- 
jects put the heaviest weight on the probability of winning, others on the 
probability of losing, and others again on the amount that could be lost. 

That some individuals like to gamble more than others can be readily ob- 
served, but laboratory arrangements make it possible to observe and analyze 
the differences between individuals in more specific aspects of decision 
making. Kogan and Wallach (1964), for example, have analyzed the choices 
of 200 men and women for various strategies—such as maximizing gain and 
minimizing loss, their preferences for long odds or even bets, and post- 
decision satisfaction—both in hypothetical situations and with actual pay- 
offs, These individual aspects of risk taking are interrelated, as one might 
expect, and are also related to personality measures, such as defensiveness 
and anxiety. 

How rational is the ordinary intelligent adult 
tainties of the future? The question, as such, is too vague. We can inquire, 
closely his choices approach an ex- 
1 certainty. The research 
model is difficult to test 
difficult but also 


in dealing with the uncer- 


however, for specific conditions, how 
plicit model of rational decision making under un 
summarized above indicates that the expected utility 
not only because the measurement of utilities is technically 
because of the variety of utilities that people expect. In experiments on 
gambling (where expected utility is expected monetary value), it appears 
that college students follow the rational model and make optimal decisions 
fairly well within the limits of their comprehension of the gambles. When 
the gambles are complicated and especially when they are also boring, stu- 
dents are unable or unwilling to follow the details. They adopt some simpler, 
e their decisions appear less optimal— 


More congenial strategy, which makes ar | 
i if not from the subject's. Why work 


rom the experimenter’s point of view 
So hard just to appear rational? f 
In general, the deviations from the optimal expected value model involve 
weighting and strategy. The model assumes that the value of a win is 
Weighted Çi.e., multiplied) by its probability, that the value of a loss is 
Weighted by its probability, and that the intelligent adult follows a single- 
minded strategy of maximizing pwVw — pıVı. But the data show that the 
value of a ‘win, or a loss, is often rated higher or lower than its probability 
and that the probability of a win, or a loss, may have a utility that is more 
than just as a multiplier of value. The strategy may be simply to minimize 
the big loss, or to maximize the number of wins, or even to choose the 
gamble that is easiest to understand. The subject oversimplifies the cognitive 
Structure by focusing on one or two elements. Thus, the question of ra- 
i zi der uncertainty has some parallels with 


toale « ce 

rep in the case of decisions un 1 i 3 

ne same question in the case of logical reasoning with categorical proposi- 
tions 
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Multidimensional Judgments on Continuous Scales 


When judgments of multidimensional objects are made on continuous 
scales, as from 1 to 7 or from extremely favorable to extremely unfavorable, 
hypotheses about combinatory processes can be examined with some pre- 
cision. There is no single normative model that prescribes how a rational man 
should proceed in this situation—except in the general sense that his judg- 
ments should be adapted to reality—but the research has been guided by the 
pervasive psychological controversy between addition and configuration, and 
the models have been designed and revised in close contact with empirical 
data. Those cognitive models that include the organization and consistency 
of motives, beliefs, and actions depart from the main concern of this book 
and have been well reviewed in books on social psychology. The emphasis 
here is on attempts to describe the judging process in general, and some 
generalizations have emerged from judgments of people, words, photographs, 
behavior symptoms, occupations, foods, and other worthwhile objects of study. 


THE SUMMATION MODEL: BREAKDOWN AND REPAIR 


One of the earliest quantitative studies of the additivity question illustrates 
a method that was widely adopted in subsequent decades and anticipated the 
trend of more recent results, Washburn, Haight, and Regensburg (1921) 
presented cards with single colors and cards combining two colors for their 
subjects to rate as to pleasantness on a seven-point scale. They obtained a 
correlation of .75 between the ratings of the combinations and the sums 
of the ratings of the single colors. Thus the experimenters suggested that in 
rating the combinations, the subjects added their ratings of the constituent 
single colors. But 15 percent of the disagreeable colors made pairs that were 
judged agreeable. Hence something more than simple addition must also Þe 
considered. When Beebe-Center (1932) reviewed the early evidence on this 
question, under the name of hedonic summation, he noted the frequency ° 


simple algebraic summation and noted also that in some cases the result © 
juxtaposing the separate stimuli was not merely a combination, but a new 
configuration. i 

A favorite illustration of Gestalt 


; psychologists is that when one perceives 
a face, even briefly, he gets a distinc 


ble fi t impression of the whole which comprises 
many separable features but also perceives the features as part of the whole. 


Faces are difficult to manipulate experimentally, except in the synthetic 
manner (see page 353) of Rodwan and Hake (1964) oo Asch (1946) ar 
ranged adjectives referring to personality traits in different lists in order t0 
illustrate how the reader organizes a unified impression of an individual and 
how the whole list influences the contributions of the constitutent words. 
What this has to do with the Perception of warm, moist peo le is disputable, 
and words may not be the best medium for E AE Gestalt prin- 


ciples of perception. But lists of words can easily be assembled for judgment 


separately and in combination. Thus this type of experiment, under the name 
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of impression formation, has supplied considerable evidence on combinatory 
processes in judgment. The question of generality will come up in connec- 
tion with judgments of other materials. 

When discrete words, or other items, are judged separately and in com- 
bination, on a prominent dimension like attractiveness, a hypothetical process 
of algebraic summation of pros and cons is intuitively reasonable. Or, if the 
judge does not add his ratings of the components, he may average them. The 
summation model and its variant, the averaging model, are similar in that they 
both assume that the judge operates additively and they both make the same 
predictions about correlations between ratings of components and ratings 
of combinations. The data from many experiments of the past fifteen years 
agree that one or the other accounts for a large share of the variance. More 
recent experiments have attempted to distinguish averaging from adding, to 
diagnose the weaknesses of such models, and to supplement them with cor- 
rections that will account for a larger proportion of the variance. 


WEIGHTING OF COMPONENTS ` 
One obvious possibility is that the judge attaches more importance to one 


component than to others, hence the model should weight the components 
correspondingly. The congruity model of Osgood, Suci, and Tannenbaum 
(1957), in order to account for ratings of combinations of adjectives and 
nouns such as hairy scientists, added the separate ratings of each weighted 
in accordance with their deviations from neutrality on a bipolar dimension. 
But when Triandis and Fishbein (1963) got ratings of individuals described 
by four characteristics (race, occupation, nationality, and religion), the sums 
of the ratings of the components, without differential weighting, correlated 
More closely with the ratings of the composites. 

_ To gain more eee prediction, hel and Rothman (1965) asked each 
judge how much weight he attached to the adjective and to the noun in 
rating combinations of these, and then weighted the components of their 
additive model correspondingly. Thus they could predict the ratings of com- 
Dinations quite well for those judges who were able to follow the detailed 
instructions, When items of the same type, such as a set of foods constituting 
a meal, are rated, the first items have somewhat more influence on the rating 
of the whole, and appropriate weighting for this primacy effect improves the 
Predictions (Anderson, 1965a; Anderson and Norman, 1964). This is not an 
unusual situation because the final judgment of a complex object is often 
tegrated from serial judgments of the components. Differential weighting 
is indicated also when the separate items of information come from sources 
that differ in credibility (Rosenbaum and Levin, 1969). Regardless of the 
Model chosen to reproduce the judge’s operations, the possibility of differ- 
ential weighting specific to the situation and even to the individual judge 


cannot be overlooked. 


CONTRAST AND EXTREMENESS 


curious discrepancy in the avera 


ging model turned up in a careful 
analysis by Willis (1960) of judgments of college students’ photographs, 
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arranged in contrasting sets of two or three extremely Tei ant Pa a 
three extremely unattractive individuals. The important finding w 4 ee 
tremeness of the judgments of the mean attractiveness of the sets relativ hi 
the mean of the scale values of the components. Overall, the pe in 
values of the components, on a scale of nine categories, were 215 ats = 
but the judgments of the sets were 2.02 and 4.58. The differences may 
small but they were consistent in several variations of the sets. 
The enhancement of contrast discovered by Willis could be due Aa 
tendency on the part of the judges to stereotype the two groups, but be 
(1963b) uncovered the same effect at less extreme positions. He asain ee 
many sets of 3 opinion statements, spaced along an I]-category scale, Ta 
instructed the judges that each set represented one person's pinion ta ad 
judged as a whole. For triplets with mean scale values of 2.87, 4.27, Her 
9.68, the judgments of the composites were 1.15, 3.80, 9.62, and l T 
Similar extremeness has also appeared in judgments of persons dehe 
excerpts from letters of recommendation in sets of 1, 2, and 4 (Brewer, ri he 
The interpretation of extreme ratings is awkward because of the po he 
of unequal scale intervals at the extremes, especially if the ratings ° see 
component items and of the combinations are made by different juc oe 
Therefore, Manis, Gleason, and Dawes (1966) had the same judges ae 
single statements about fraternities and pairs of statements Cattributed F H 
person), and used a simple ordinal measure of extremeness: the aa Y he 
judgments of the pairs that were more extreme than their components. id 
pairs could be judged more extreme or more neutral than their ae ple 
but the Preponderant deviation was toward the extremes. Another ae 
procedure used by these experimenters presented three statements attri? 


š sented the 
to one person and asked the judge to select the one that best represen gst €X 
F; S č $ a OS 
person's attitude. The hypothesis of extremeness predicts that the m 
treme statement would 


>, oF 
sxtreme, í 
be selected more often than the least extr 
most neutral, statemen 


ts, and this prediction also was confirmed. 
The extremeness effect is embarr 


i 3 , no 
assing for a simple averaging ™ 

summation model would not be chal 
binations are seldom as extreme as si 


three or four components are summed. Apparently, the judges strike 4 
promise between the mean and the sum of all components. Manis, 
and Dawes su 


ae 
luded ggest a weighted average in which all components as the 
clude but the effect of the number of components diminishes °> 
logarithm of the number, 

Meee AND ADDING if the 

athematically, an addi i dal 3 
in a mo 

weights attached to th 8 model becomes an averaging at hand 
Aar © the components sum to unity. Since the data ° ent © 
: f = some support for both, Anderson (1965b) ‘designed an ae a 

i engte the two models, He used an unbounded scale since a P? dding 
scale, such as l to 7, could Testrict the Tange and thus be unfair to ne i 
model. Beginning with a stand 


; 1 F rances 
ardized list of adjectives rated in ad 
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drew some rated highly favorable CH) (eg., reasonable and truthful), some 
rated moderately favorable (M+) Cegn painstaking and persuasive), some 
rated moderately unfavorable (M—) (eg., unpopular and dependent), and 
some rated highly unfavorable CL) Ceg.» spiteful and abusive). These were 
presented in sets of two and sets of four to college students, with instructions 
to rate the person described by these sets of adjectives on an unbounded 
scale centered at 50. They were told to use 50 for a person they would neither 
like nor dislike, to use lower numbers for persons they would dislike, and 
higher numbers for persons they would like. Means are shown in Table 8.2. 

The argument for averaging is the fact that the mean for HHM+M+ 
falls between the means for HH and M+M+ and the mean for 
LLM~M-~— falls between LL and M—M-. The argument for additivity 
is that the mean for HHHH is higher than the mean for HH and that 
LLLL is lower than LL. But the mean for HHHH is not twice that for 
HH and the subjects had preliminary practice with the scale so that they 
could space out their ratings if they wished. Anderson therefore accounts for 
this discrepancy from the averaging model by 
assumed to be at the middle of the scale. If the overall impression is the 
average of all these, HHHH plus 50 will average out to a value somewhat 
higher than HH plus 50; likewise, LLLL plus 50 will average a little lower 
than LL plus 50. 

Podell and Amster (1966) also found, when they used one, three, and six 
deviated more from the neutral 


To explore this effect further, 


invoking an initial impression, 


adjectives, that the composite judgment 
Point as the number of adjectives increased. 
Anderson (1967) made up sets of one, two, three, four, six, and nine ad- 
Jectives. An averaging model which includes the initial impression Cassumed 
to be at the neutral point of the scale) and the scale value of each com- 
Ponent, with diminishing weight as the number increases, predicted the 
Composite judgments quite well. One system of weights, computed for each 
Judge individually, was sufficient for all set sizes. 

At this point there seems to be some agreement that, in these situations 
here the components are discrete and independent, the judge does combine 
the information from each component in a weighted average, not as ex- 


Ww. 


TABLE G2, mpar arigos or sets OF THO ANP BONA ADJECTIVES 


Set Type Response Set Type Repone 
m 72.85 HHHH Pr 
ong M+M+M+M-+ 63.20 
oa 57.56 
oe 42.18 M—M—M—M— 56 56 
“ 23.70 LLLL ae 
HHM+M+ A 
LLM—M— pe 


F 
tom Anderson (1965b). 
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treme as the sum of all components nor as neutral as the simple ee te 
Judges are influenced by the number of items of information, but this m a 
ence is a decreasing function of the number. The nature of the camponer 
must also be considered, perhaps by an adjustment of the weights, parmi 
when similar experiments were conducted with sets of gifts, traflic violations, 
and behavioral symptoms, the data for gifts came closest to the P 
of a summation model, as one might expect, while the data for traffic ie ik 
tions came closest to an averaging model (Hicks and Campbell, 1303); 
Judgments of the social class of various individuals, based on three D 
ponents (occupation, income, and education), fit an averagaing model qi i 
well (Himmelfarb and Senn, 1969), and so did evaluative judgments o 
“little-known” groups of people described by taped travelogues containing 
positive and negative adjectives (Bossart and Di Vesta, 1966). 


INTERACTION EFFECTS , Cece 
Interaction effects are embarrassing to adding and averaging models, an 
such effects do show up occasionally in analysis of the variance of the judg: 
ments. When students expressed their preferences for working in four oc- 
cupations in four cities, most composite judgments could be BARE 
an additive combination; but the attractiveness of teaching interacted with 
the attractiveness of cities (Sidowski and Anderson, 1967). This means that 
the attractiveness of teaching in Clevel 
attractiveness of teaching in gencral 
the addition of these two m 
Teaching in Clevel 


and, for example, depended on the 
and of Cleveland in general but that 
ain effects did not account for all the Ere 
and might be judged as unusually enjoyable, as if the 


‘ : ; mee ther than added. 
attractiveness of these two dimensions were multiplied rather than a 
And, as another example, when 


dents were combined and j 
effects were strong 
Moving from th 


adjectives and photographs of college ren 
udged as to desirability on a date, interactio! 
CLampel and Anderson, 1968). ead 
e familiar psychological laboratory to an arena that rr 
chologists seldom enter, Slovic (1969a) found intriguing signs of interáctuon 
effects in the stock market. He persuaded 2 experienced stockbrokers to i 
the growth potential of 128 companies on the basis of 11 factors—such ft 
dividend yield for the past year, prospects for the coming year, and pron 
margin trend—taken from standard financial reports. The information 1n 
each dimension was reduced to 2 values, such as down and up, in order to 
facilitate the analysis of variance, and the brokers’ judgments were made $ 
a nine-category commendation not to buy through neutra 
Most of the 11 factors influenced os 
xpected, but definite interaction enon 
yield, for example, was considered a a 

j ‘hen the profit margin trend was dowi 
while the reverse was true when the profit margin was up. Five of ae 
interaction effects were found in the data of 1 of these expert judges and : 
in the other, demonstrating that substantial configural processing of hi 
information on the Separate dimensions did occur. Most investigators who 
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have studied multidimensional judgments quantitatively have reported, often 
with some surprise, that the judges are seldom as complex as they think 
they are; but unless there is something peculiar about the binary informa- 
tion that was furnished to these stockbrokers, more evidence of nonadditive 
combinations can be expected in further research. 


CORRELATED COMPONENTS 

Since Wishner (1960) demonstrated the importance of correlations within 
lists of adjectives in studies of impression formation, careful investigators 
have tried to combine adjectives that are independent in meaning. But sup- 
Pose the judge is asked to rate the attractivenes of someone described by 
three adjectives, one of which is inconsistent with the others, such as honest, 
gloomy, and deceitful? When Anderson and Jacobson (1965) gave lists of 
this nature to their subjects, with the instruction that each adjective was 
equally important, they found that the averaging model predicted the com- 
Posite judgments fairly well. Then they varied the instructions, telling other 
subjects that those who had described the people did not know them equally 
well and one adjective might not really apply. With these discounting in- 
Structions, the averaging procedure yielded some discrepancies, which could 
be removed by weighting. Thus the results agreed with the hypothesis that 
the subjects identified the inconsistent one of three and gave it less weight 
in their overall judgments. Discounting in this sense is the same as giving 
less weight to information from a source of low credibility, as the judges of 
the Rosenbaum and Levin (1969) experiment did. 
_ Dustin and Baldwin (1966) tried to take care of such relations in the 
put by a model that specified the correlations between the meanings of 
Pairs of adjectives, including those with positive correlations, which can be 
called redundant, and those with negative correlations, which can be called 
consistent. Further evidence for the importance of the relatedness of the 
items was provided by Schmidt (1969). Two sentences containing redundant 
information about a ‘person (Mr. A is intelligent. Mr. A does his work in- 
telligently,) did not contribute as much to the judged attractiveness of 

r. A as two sentences, otherwise equated, containing different information 
Mr. A is intelligent. Mr. A is frank with Mr. Y.). Thus the possibility of 
correlated components indicates another adjustment that should be included, 
cither by weighting or by a correlation term. 

he familiar DE ase effect, to be discussed in the next chapter, is a further 
complication that influences these judgments, like most others (Hicks and 
Campbell, 1965; Wishner, 1960; Wyer and Dermer, 1968), and must be 
Considered in a complete account of combinatory principles. 
here is a certain built-in bias in the above experiments. In order to 

check on additivity, they begin with components that are familiar as inde- 
Pendent items, like aiieties, and test how they are combined. Conceivably, 
ot experiments could proceed in the opposite direction, beginning with a 
Cognitive unity and testing whether independent additive components can 
© isolated. Is the fact that this has not been done an argument against 
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i i ` idere: Nt, iS per 
additivity? The adverb-adjective combination, to be considered next. is P 
= ici i i sive adverbs have little meaning 
haps an exception to this criticism, since intensive adverbs have little me ‘ 
when isolated from adjectives. ORIN te ee 
All in all, this vintage model, constructed on the additivity princi Í i 
; ~ 
i ire ard a weighted s r a weightec 
leaning, as circumstances require, toward a weighted sum « i = a 
average, has performed tolerably well. Weighted down with accessories 
a age, ~~ = 5 a ; Sri ge ie wath’ 
adjust for special conditions, however, it has lost the elegance of k ` 
‘ : i i rs ahez i av mecr 
years. And if more interactions crop up in the vears ahead, it may no long 
be able to make the grade. 


LINGUISTIC STRUCTURES 
. . adiectives ithout 
Experiments on impression formation have combined adjectives Ta Mi 
jectivies. are combined in acvdehuite 
regard to structure; but adverbs and adjectives are combined in a ¢ ue 
i s slightly wicked or very charming. Any write! 
grammatical structure, such as s ightly wicked or very £: ANN 
or speaker would expect that if charming is rated as a good word, | l 
. . . . va Š Tor ng ver) 
very in front of it would make it better; if wicked is a bad word, putti aai 
in front of it would make it worse. Intuitively, abverbs and adjective me 
5 ath X ic re a = is intuition, 
to combine by multiplication rather than addition. To test this rue A 
5 2 y opi ihe F > cha 
Cliff (1959) scaled 15 evaluative adjectives (such as disgusting and fait) 
A P A . A tbs Csuch as slightly 
ing) and all combinations of these with 9 intensive adverbs (such as slig oe 
6 a ` 0rd 
and very), by having students at 3 colleges rate them on the tee ‘sian 
z y et E : ; vielded evalua 
unfavorable dimension. Then analysis of the judgments yielded eva ees 
4 a ‘| accoun 
scale values for the adverbs as w ell as the adjectives, The model acc said 
: ‘ f a combination ce 
for the judgments very well, in that the scale value of a combinatio i 
E <1 a ae è lective by thevscale valut 
be predicted by multiplying the scale value of the adjective by s 
of the adverb. The scale y 
says that extremely 
results for | 


value of extremely, for example, was 1.5, so the ne 
good is one-and-a-half times as good as good, nh 
arge groups of subjects indicated that the judging process did, 
fact, proceed along these lines. 

The multiplicative 
(1966), who found th 
individuals as well 
such as that 


model was checked again by Stilson and wani 
at it reproduced the judgments accurately, for a 
as groups. They reported a few interesting A a 
ambiguous adjective tolerable, the meaning of which interacts 
with the adverbs that modify it. Pibe 

Obviously, this multiplicative type of combination is a property i Sa 
linguistic structure, specifically the adverb-adjective structure. The gaa 
refer to abstract dimensions that describe people and events, but the adverb: 
do not refer to anything in the para 
modify the adjectives, to increase or decrease their effects. Hence it is er 
surprising that judgments of the linguistic combinations do not add or average 
like judements of two adjectives, 

Although sentences are 
components as much as 
tences about men can be 
verbs, and noun objects, 
(1968) constructed many 


. + . os ance is ta 
same way; their function in a sentence 


longer, the components do not depend on the Sa 
an intensive adverb depends on an adjective. ary, 
synthetically constructed by combining adjectives, 
thus: “The vicious man harms beggars.” Gollub 
of these and tried to account for evaluative ratings 
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ri > a aseri ro 
of the men described by such sentences from evaluative ratings of the three 
ce ey ce : ; = 
mponents. First treating the verb and the object together as the predicate 
ars ` rot S 27 ees s as as 
simple model that assumed additive combination of adjective and predicate 
accounted for 98 percent of the variance of x j 
à : cent of the variance of the ratings Ci i i 
A p cene ] ratings. But the contribution 
j predicate was best explained as a multiplicative interaction of verb 
ar jec "er, jecti i 7 
ad object. Overall, the adjective accounted for 37 percent, the verb 37 per- 
ce F NE . . . c A k 
ent, and the verb-object interaction 22 percent of the variance of the ratings 
gs- 
it 


In these sentences, the object of the man’s behavior had little influenc 
was the adjective applied to the man and the verb describing his behavior 
that had the most influence on the multidimensional rating. is 

ý The lesson to be learned from these studies of linguistic structures is clear. 
2valuative judgments of syntactic combinations cannot be predicted from 
evaluative judgments of their components as well as lists of independent 
words, The reader not only adds the effects of the components, he often 


multiplies one by another. 


q HE CORRELATION MODEL 
a! sima model, which has been mentioned above as a measure of 
the R of unidimensional judgments, is particularly appropriate for 
analysis of multidimensional judgments. A clinician, for example, is 
found in a person’s folder 
, and that judgment is 
items. In the 


ar a items of information that might be 
aR ae s” minke 3 summary judgment about him 
eed re e BEESON to be predicted from the separate 
correlated a cae judgment of the quality of solutions to a problem was 
ings. TI we PRES components of quality, s ; 

S hus the statistical model is the same as that used to predict average 
ard procedures 


uch as unusual word mean- 


schoo ne : 
l grade or job success from several test variables. Stand 
n yield a correlation coefficient for each 


nee Correlation and regre i corn i 
ai a ee measuring its linear contribution to the summary judg- 
» and a multiple correlation coefficient, measuring the predictive efh- 
This model does not differentiate 
n computed for each 
1 the possibilities of 


e e Tao additive combination. 

Co poe ery and averaging, but weights have bee: 

AR as in the studies of impression formation, anc 
arity and interaction have been examined. 


= UTILIZATION 
As « . z ¢ ¥ 
$ an illustration of the correlation approach, Hoffman (1960) had two 


ee aa on a 9-point scale, the intelligence of 100 persons from ä 
of Eneli Speirs cues or predictors, such as bighsehyn] rating, effective use 
Piee h, mother’s education, and credit hours o i 
94g fe A The best linear combination of the pre ia variables correlated 
indse ith the judgments of one judge. Hence, the assumption that the 
ie seed ina linear additive fashion seems reasonable because any 
oker a from this assumption would reduce the ca he correlation. For the 
tained, ge, the corresponding correlation was -829. Similar results were ob- 
vhen 2 judges rated 150 persons as to sociability from knowledge of 


attempted, all expressed in 
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the 8 scores each had made on a printed personality test. For the first judge, 
the multiple correlation was .901, for the other .770. Thus the linear additive 
model accounted very well for the overall judgments of 2 judges, not so 
well for 2 others. 

Hoffman went further and asked the judges how they weighted each 
source of information in arriving at the overall judgment. In both analyses, 
the subjective weights reported by one judge agreed quite well with the 
weights computed by the multiple regression procedure, while those re- 
ported by the other judge did not. One judge in the first study, for example, 
said that he had weighted high-school rating, effective use of English and 
study habits strongly and equally; but his judgments were actually correlated 
very closely with high-school rating and negligibly with the other variables. 

There are some technical questions about the computation of a set of 
weights to represent the independent contribution of each component CI Ioff- 
man 1962; Ward, 1962), but it is methodologically more important here to 
consider the relation between weights objectively computed from the judg- 
ments and the judge’s verbal statements about weights. As in concept re- 
search, when verbal statements do not agree with response-based statistics, 
Some investigators assume that the verbal statements are misleading while 
others assume that the statistics are misleading. We must return to this con- 
troversy, but it should be noted here that in the Hoffman experiment, as in 
most others, the comparison is hardly fair, because the objective measures 
were carefully derived from many responses while the verbal statements were 
made only once, with no training. 

One of the abilities claimed for the sensitive, experienced clinician is the 
ability to go beyond the linear additive model, to use nonlinear information 
and to combine information configurally rather than additively. Without 
entering into the contest between clinician and formula, we can raise the 
question whether clinicians and others do actually integrate information in 
these more complex combinations. The conventional multiple correlation 
model can be expanded to include curviline 


(Goldberg, 1968), and analy: 


for the estimation of interact 


arity and configural interaction 
sis of variance has been used (as noted above) 
RO ion effects when the information is categorical. 
tom these reports and others (Hammond and Summers, 1965) 
rsud complexities can be found if one looks for them, but after re- 
moving the main effects the variance they account for is usually small. 


Like adjectiv is A tpg eae 
aana “69 describing persons, however, the information in different 
C at enter into inical į 2 
a clinical judgme 


h cg eae nt may be inconsistent, aN 
when this situation is arran 


sn ay: ged experimentally, the linear multiple correlation 
ess adequate (Hoffman, Slovic, and Rorer, 1968; Slovic, 1966). As 

7 earlier, Anderson and Jacobson (1965) try to take care of such cases 
ži ngerti = she ee But Hoffman argues that evidence of change 
: ponent due to the context of other components 1S 
evidence for a configural—as opposed to linear—utilization of cues. What 
is consistent or inconsistent is an individual matter; hence, models of cogni- 
tive processes will have better descriptive and predictive power, according t° 
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Hoffman, when they include the structure of the data as perceived by each 
judge. 


CUE VALIDITY 
It is one task to describe how the judge utilizes several cues to make one 


judgment, and quite another to analyze the determinants of the accuracy 
of the judgment. The second task is not only an important practical one, 
as in the training of clinical psychologists, but has broader implications for 
judgment in general. Hammond, Hursch, and Todd (1964) have developed 
a lens model for this problem, which represents, in the manner of Egon 
Brunswik, not only the relations between the cues and the judgment, but 
also the relation between these cues and the distal variable to be judged. 
Fig. 8.4, taken from Hammond and Summers (1965), illustrates how the 
judge utilizes the available cues as mediators through which he attempts 
to focus on the distal variable. 

As an example, consider judgments of the intelligence of persons whose 
IQs are known to the investigator but not to the judge. A clinician is 
asked to estimate the 1Qs from Rorschach signs, which are called cues 
because they are used as mediators for the judgment of the distal variable, 
intelligence. The correlation between the judgments and the objectively 
measured IQs, called the achievement correlation, depends on the correlations 
between the judgments and the cues, called cue utilization correlations, 
but also on the correlations between the cues and the objectively measured 
IQs, called the ecological validity of the cues. Obviously, if the dimensions 
that are utilized are not related to the objects of judgment, the judgments 
cannot be accurate. Although the models previously discussed have been 


Achievement (rg) 


S's 


Distal 
estimate 


variable 


Cue utilization (rs) 


Ecological validity (rei) 
Cues (Xi) 


FIGURE 8.4. Brunswik’s lens model. The achievement correlation, ta, is the correlation 

etween the subject's estimate and the variable estimated. The correlation, te, between 
ny cue, X, and the variable estimates is the ecological validity of the cue. The 
Correlation, ts, between any cue, X, an the estimate represents cue 


Utilization. From Hammond and Summers (1965). 
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concerned with the cue utilization side of the lens, the ecological validity of 
the cues is critical to the analysis of the judging process and to training of 
judgment. 

In one study (Grebstein, 1963) of estimations of Wechsler-Bellevuc 
1Qs from Rorschach signs, a correlation of .50 was achieved by naive judges 
(graduate students with one course in the Rorschach), a correlation of 65 
by semisophisticated judges (second-year VA trainees) and .68 by sophisti- 
cated judges (with the Ph.D and five years of experience). The cue utili- 
zation correlations were about the same—.90 to .95—for the three groups, 
which means that they were all making linear use of the cues. Since 
the multiple correlation on the ecological side was only .79, the semiso- 
phisticated and the sophisticated judges were almost as accurate as they 
could be. The errors in the beginners’ Judgments were due to a mismatch 
between cue validity and cue utilization. For example, one Rorschach cue, 
said to indicate intellectual energy, had an ecological validity correlation of 
-18; but the naive judges’ data showed a cue utilization correlation of .63, 
as compared with .38 and .27 for the more sophisticated. Evidently, the 
naive clinicians were relying on a textbook rule, while the sophisticated 
clinicians were depending on a rule developed from their experience. In 
general, the sophisticated clinicians must have been using a set of weights 
for the different cues that were about the same as the cue validities, because 


they were getting practically all the predictability they could out of the 
system. 

Training for such abstract 
of the Separate dimensions, 
whether appropriate w 
from practice with in 
experience. A partial 


judgments must include conceptualization 
but the evidence above raises the questigh 
eighting of the dimensions could be learned directly 
mediate feedback rather than only from prolonged 
answer to this comes from a study of judgment an a 
quasi-clinical situation by Newton (1965). He gave his judges four kinds 
of information about college students—such as test scores and personality 
ratings by high-school principals—and asked them to predict average grades 
during the first term in college. Then he calculated the statistics described 
above from the lens model and brought the Judges back for more judgments 
with various kinds of feedback. Judges given helr achievement and utili- 
zation correlations from the first series of ‘felons did not improve. Those 
given their achievement, utilization, and validity correlations did show some 
improvement in their Predictions, from .58 to 65. The gains, according tO 
the above analysis, were due to better match between ihe cue utilization 
and cue validity correlations, 

This outline of the lens model is oversimplified, but it illustrates one of 
the few approaches to take into account the accuracy of multidimensional 
judgments. It is a normative model in the sense that it recommends match- 


ing the utilization to the validity correlations. Since objective measurement 
of the distal dimension is necessary 


cue validity correlations, this model 
situations such as estimations of in 


z ; he 

for computing the achievement and dl l 
acia 

has been applied only to a few spec! : 
tellizence and grades, but it teaches 4 
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lesson of wider signific: ji i 
ee wider significance. Without much difficulty, students can adjust 
cir judgments to correspond with those of peers and teachers; it is en- 
heit fi sy ‘ 
couraging to know that under some conditions they can also bring their 
7 ath) . . . Š 3 
judgments into closer correspondence with the objective situation. 


LEARNING TO CORRELATE JUDGMENTS WITH MULTIPLE CUES 
Some of the questions raised above have been investigated in more detail 
by laboratory experiments with artificial materials; these require the subject 
R e eee ee 
g a a acy. a a cue and 
the criterion response to be learned can be varied from zero to unity and 
can be lincar or nonlinear. Thus these are experiments about learning but 
they are about learning of a complex, cognitive type. The conditions that 
are manipulated by the experimenter are not the ones manipulated in con- 
ventional learning experiments, but correlations among cues and between 
cues and feedback. The correct combination of cues as defined by the 
experimenter governs the informational feedback given to the subject, 
f judgment is measured by the increase 


ar > j ss : 
ad the improvement in accuracy 0 
and the experimenter’s 


in the correlation between the subject's judgments 
definition of correct judgments. ; ‘ 
One of the earliest experiments o 
nipulated the correlation between the 
ale a different validity, and came to the conclusion t 
utilize many probabilistic cues simultaneously. 


me è i : 
nt to illustrate the procedure is one by Summers (1962), w 
dimensions (position, color, and 


indicated his judgment 
the correct line length 


f this type, by Smedslund (1955), ma- 
cues and the correct response, giving 
hat the subjects could 
A convenient experi- 
ho constructed 


2 ‘ 
ae piojeciing triangles that varied in 3 a 
graphi -all color), 8 values of each. The subject 
See acne baie be allowed to a harreta es 
he e a r eee ee was renden on all 
3. Letting Y repr > 3 soa 7 f jud b } armed and X the cues. 
kanie present the criterion of judgment to be tear ; : 
ie relation to be learned in this experiment was: y = 2X, + 1.5X2 + Xz. 
therwise expressed, the correlation between criterion and cue for X, was 
7 for Xo Sésand for Xa 3% TO eliminate effects specific to the cues, 
these values were rotated among the 3 cues; but for any condition, the subject 
Was provided all the information necessary to make a correct judgment. 

When these slides were shown to 30 ninth-grade students, the results 
demonstrated that they did make use of all 3 cues. Their overall accuracy—as 
Measured by the correlation between students’ estimates and correct line 
engths in blocks of 64 judgments—increased with practice. More than that, 
cy learned to use the 3 cues differentially. As practice continued, they 
farned to put the most weight on the most important cue, that is, the cue 
with the highest correlation with the correct line length, and the least 


Weig $ 
rent on the least important cue. 

ee Mprovement in using nonlinear in 
Deri Ni ET 
Perimental construction of a nonlinear 


formation can be investigated by the 


dimension. Hammond and Summers 
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m ` test 

(1965) simplified the judging task by printing numbers, a n iii 
scores, on one side of a series of cards and the criterion n = a yan 
these numbers on the reverse side. Half the variance of the en a ga 
determined by one set of numbers, X,, and half by the other pre ee 
was related to the criterion in a linear manner, while the oa = K 
form of a sine curve, and the criterion combined the two { a l mor Ci 
The subjects practiced estimating the criterion and got — pea ica 
tion by turning over the card. In general, they improvec voces et 
judging the criterion, but the improvement was dependent a is ~ Seat 
One group of subjects, told simply to make inferences on the ba a rion 
scores, made little use of the nonlinear cue; another group, told 5 ae 
linear and nonlinear relations were involved, did somewhat Daae a kada 
third group, told which was the linear and which the nonlinear cue, 
to make good use of the nonlinear information. ioscan st 8 

These findings were verified and extended in another g nda poe 
type by Summers and Hammond (1966), in which the C natai 
vertical lines of varying lengths on a card and the criterion to a = 
was printed on the reverse side. When the instructions identihed t NE 
and nonlinear cues, the subjects soon learned to make os “ners 
both and attained achievement correlations in the .80’s and .90's. $ T Tada 
and Hammond also called attention to the resemblance between p 2 
ment tasks and concept attainment tasks. Asking the subject ae Tae dye 
criterion from given cues is analogous to the concept task that i = ~ fap ts 
relevant attributes and asks the subject to find the rule relating aie 
class defined by the experimenter. In the other conditions, the on er not 
supplies more information about the rule or rules. The analogy oo experi- 
be pushed too far, however, because the rules considered in ger sane 
ments with continuous dimensions are not quite like those of combine 
discrete attributes. septs and 

Chapter 2 noted the similarities and differences between class conc aed by 
dimensional concepts. The attainment of a class concept is demons ae other 
the correct use of two responses, one indicating class inclusion and the yor 
exclusion, for a collection of stimulus objects. When the responses en 
on a scale of eight to ten intervals, the activity is usually called i 1 cone 
rating, or scaling, and consistency implies attainment of a ee be 
cept even though, as in the case of class concepts, the concept may ‘onset 
explicitly defined. An experiment by Azuma and Cronbach (1966) panes 
gated the intermediate case, between the class concept and the — “Ther 
concept, by requiring one of four responses, A, B, C, D, for each rk Med 
described the task as the attainment of a scalar concept. Each Spa e which 
sisted of a square frame containing a circle and a cross, the positions o e 
varied horizontally and vertically, Thus there were four dimensions i vant; 
in the stimulus situation, two of which the experimenters defined as irre : ae 
the criterion to be learned was a weighted average of the two relevant “ema 
sions, thus: Y = (2X, 4 X»)/3. The task was to combine the two _ oe 
dimensions, weighting one twice as much as the other, and use the 
interval scale consistently for the composite. 
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: Information about right and wrong responses was given, and most sub- 
jects improved Cas shown by increases over several blocks of trials) in the 
mean correlation between responses given and correct responses. This mean 
correlation eventually reached .90. More analytically, the correlations between 
responses and relevant cues increased although the correlations with ir- 
relevant cues did not, and the correlations with X, were larger than the cor- 
relations with X}. But these are all mean correlations. The experimenters were 
impressed by the differences between subjects in judging tendencies. One 
subject, for example, did not improve at all in his matches to the weightings 
defined by the experimenter, nor was he consistent with himself from block to 
block in the way he weighted the stimulus dimensions. From the comments 
of their subjects, which many other experimenters do not trust, Azuma and 
Cronbach concluded that the overall achievement correlations may be mis- 
leading, As in experiments with complex classes, the subjects can divide the 
Problem into subproblems to be solved in turn, although the statistics measure 
of these criticisms of the correlation method 


only overall performance. Some 
of analysis may be peculiar to the four-interval scale, which can be treated by 
the subjects as four categories; but the variations between individuals can 
be observed in all experiments. : 

College students can learn to make judgments on the basis of rather com- 
Plex interactions between cues when the patterns to be judged are slides 
designed to simulate highly magnified blood cells (Summers, Summers, and 

arkau, 1969), Cue 1 was based on the amount of crushed dark glass scattered 
across the backeround; Cue 2 was the size of the rough gray disks that 
aPpeared in each slide. Both of these relevant cues had eight possible values, 
and irrelevant variations also appeared in the slides. The subjects were asked 
to estimate the age of the cells in each slide and were then given the correct 
information. In ane condition, the criterion was a simple additive function 
er it was a multiplicative function. In a third con- 


of 
the two cues; in anoth Jue: of Cue: 2 and i 
va ; and in 


oe the value of Cue 1 was the exponent of the 
te fourth, Cue 2 was the exponent of Cue 1. 


Over 2 days of practice about 3 hours altogether, definite improvement 


occurred, reaching achievement correlations of .60 or above for each condition, 
me of this improvement could have been due to the use of the linear 
Component in the stimulus patterns, but the analysis demonstrated that there 
was an improvement in the use of the nonlinear information also. Although 
ere was an initial tendency for all subjects to rely ona ie da a they 
amed, over 640 judgments, to use the relationship to which they were 


EX Posed Th ific nature of the perceptual patterns must have been 

i 3 e specific ‘ f ; i $ 

portant facet Š exponential function was considerably more difficult 
ause one ©) 


an the other. 


CORR s OF GAMBLING . A 5.8 
Le penta ae be psychometrics of gambling. This decision 
situa S return briefly to “sith the aid of the expected value model, 


ti ed wit = a, ff is 
On has usually been analyz or a loss, by its subjective probability, 


nich . 
doned this model and switched to cor- 


aes Weights the value of a win, 
vic and Lichtenstein (1968) aban 
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relation analysis to describe how the dimensions of a gamble are weighted 
and combined. They offered their subjects duplex gambles and informed them 
of the probabilities of wins and losses. One gamble, for example, offered a .8 
probability of winning $2 and a .8 probability of losing $1. Another o fered 
a .2 probability of winning $I and a .4 probability of losing $2. The subjects 
were asked to judge the attractiveness of each duplex gamble and bid an ap- 
propriate amount of money for it. The bids for some gambles wou d. of 


course, be negative amounts; the subject would expect to be paid to play 
them. 

The expected value model assumes that values and probabilities are com- 
bined multiplicatively, but Slovic and Lichtenstein used a simple corre ation 
model, which assumes that the judgment is linearly related to the four 
dimensions of a gamble and that these dimensions are combined additively: 
In one experiment, the predictions from this model correlated .86 with the 
actual bids. But the most striking outcome was the extent of the individual dif- 
ferences in the weights attached to the four dimensions. The correlations 
between risk dimensions and bids for one subject were: for the probability of 
winning .81; for the value of the win .13; for the probability of losing —.27; 
and for the value of the loss —.30. The corresponding correlations for another 
subject were: .19, .10, —.23. and — 9]. Although the experiment did not 
compare this model with others, the data show once again that a linear 
additive model can account for a large share of the aranca of the judg 
ments if different weights are fitted to the data. More important psychologi- 
cally, the data suggest that the subjects did not regard probabilities merely 


as multipliers of values, as illustrated in the expression pV, or as adverbs 
) give 


intensifying adjectives, as illustrated by very charming. They seemed t 
the probability of a win, or a loss, some weight of ie cnt as in some of the 
experiments mentioned in the earlier discussion of decision models. And, as 
illustrated by the correlations listed above, the importance attached to a loss 
is not necessarily the mathematical opposite of that attached to a win. 


JUDGMENT MODELS ; D JUDGMENT PROCESSES 
This chapter, like the others, is concerned with man’s intellectual processes 
and relies on analysis of objective data more than the subject's statements: 


Seve: matics Š P e . se >en 
Several mathematical models for the integration of information have beer 
discussed because each embodies a hypothesis about judging processes an 
sugg ; akel But 


ests objective measures to be computed as tests of the hypothesis. 
model building and testing is only one phase of the search for psychologica 
understanding. The appeal of an elegant model capable of embracing moe 
the variance of judgments should not distract us from our main concern 
with judgment processes. Attempts to fit a linear model to judgment data 
psychologically enlightening because they assume that the judge simply adds 
his judgment of one constituent dimension to his judgment of another. The 
experiments reviewed here indicate that, with a few interesting exceptions: 
the linear model can account for most of the data. When analysis of variance 
is carried out, the main effects are large and interaction effects typically small. 


are 


a 
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When correlations are computed, the linear multiple correlation is usually 
lar 


This is the evidence for the common assertion that judges are simpler 
than thev think thev are. 

Although it is customary to speak of predicting the summary judgment or 
Criterion from the separate dimensions, these are not predictions in a temporal 
sense. After the data are collected, weights are calculated, by one method or 
another, for each dimension. This is a mathematical trick to make the model 
work better, but the obtained weights can give us valuable psychological 
information about the importance of each dimension to the judge. The in- 
formation is dependable, however, only if the model is appropriate. Calcula- 
tion of a few weighting parameters ex post facto, like a few adjustments in a 
ready-made suit, can give an inappropriate model the appearance of good fit 
and thus vield spurious information about what the judge considers important. 

Furthermore, changes in weights are often open to more than one interpreta- 
tion. As noted above. Anderson (1965a) was able to account for the primacy 
effect in sion formation by serial adjustment of the weights attached 
to (1969) has offered a change-of-meaning 
ssumes that the meaning of a trait 


impre: 
the adjectives. But Chalmers 
model to account for such effects. He S or 
adjective, as it applies to the person described by a sequence of adjectives, 
“depe or facilitation undergone by the trait 


nds on the degree of interference 2 
âs a result of the antecedent a in the sequence [p. 453].” Although the 
Processes assumed are different, these alternative formulations are mathe- 
Matically similar and lead to the same predictions. 2 l i ) 

‘he gencral difficulty with this approach is that predictions from alternative 
Mathematical formulations, like predictions from alternative verbal formula- 
tions > E ly when the hypothesized functional 
relations are monotonic, the range restricted, the data fallible, and when 
dividuals are grouped "Wiggins and Hoffman (1968), after a careful study 
that Separated ene = $ £ care ee (summarized below), pointed out that the 

. ypes of judg 


4 5 we ge. “The judg- 
Nees between linear and configural judges w ere not large judg 
configural clinicians can often be 


del [p. 77].” And this statement 


» are not easily distinguished, especial 


differe 
Ments 


ts of even the most seemingly 
estim 


eml a Ta a precision i a linear mo 
lally we > reversed. . 
ee he E by ya us in other contexts that the ei T a 
pothesis docs not prove another. Thus, under the usual con ee judg- 
mot experiments, the negative statement that deviations from Rin ic 
me tit significant is not equivalent to the positive statement that rhe linear 
mode] fairly represents what the judges are doing. As Creen KIRGA) basrpuk 
Ty “the perverse EHS of linearity” is due largely to methods of 
analysis that basi as the assumption of a linear function, compute the 
Variance that a be attributed to linearity, and find small residuals to be 
Attributed to nonlinearity It may turn out by this method that ey peinent 
of the variance is att rable to linearity and 10 percent of nonlinearity; 
abale Facia. or some other, might E a a 
urvilinear terms. One is more likely to find evidence for conf gural 
Judgment if, instead of fishing for nonlinear residuals, one begins with an 
a: s? d 


ereas a 
Vari P 
, “Tance in c 
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explicit hypothesis about a judgment process that involves curvilinear use of 
the information or interactions between components and tests this hy pothesis 
directly. The studies of adverbs as multipliers supply a good illustration of 
this more explicit procedure, and that by Kleinmuntz (1968) takes a step in 
this direction by describing configural decision rules in advance. The more 
recent research and reinterpretations of older research suggest that judges 
are not as simple as the early judgment models were. 

Mathematical models of judgment tend to be applied to judgment situa 
tions that are easily quantified, as when the information on several dimensions 
is homogeneous or can be homogenized by conversation to standard units. 
But in many situations, the information to be integrated is quite diverse and 
the possibility of summation seems remote. To take one illustration, when 
you apply for a loan, your friendly banker will collect many diverse bits of 
information from you, usually by filling in a standard information blank. 
Aside from the monetary information on assets and liabilities, his treatment 
of the data is likely to be a serial one: he inspects each item as a possible 
reason for rejection. Serial procedures must be more frequent than current 
research indicates. In many complex situations, the data are not integrated by 
one multidimensional judgment but the decision is made by serial elimination 
of alternatives. A search for one general combinatory rule will be as dis- 
appointing as the search for one psychophysical law or one learning curve: 
Individual differences in combinatory strategies are as enlightening as general 
principles. 


JUDGING PERSONS ON SEVERAL TRAITS 

When someone has multidimensional information to communicate, he 
(depending on the conventions of the situation and his own cognitive style 
write a descriptive paragraph in the form of connected discourse, Of go to 
the other extreme and wrap it all up in an either-or judgment. Between these 
extremes, he may summarize the information in one rating on a continuous 
scale, as just discussed above; or he may preserve the multidimensional infor 
mation by making separate ratings on separate scales. The output as well a5 
the input may be multidimensional. 

Analysis of this judgment situation has been stimulated by the ques 
whether the dimensions of information are really kept separate. The vener ie 
hypothesis of a halo effect Çi.e., that people rated high on one desirable 
trait are rated high on other desirable traits) is equivalent to the assertio” 
that the output is less multidimensional than the input. The experiment 


i 3 ; > is 
reviewed above ask how the judge combines the dimensions when he ie 
required to make a summ is whether ¢ 
judge does so w 


may 


tion 
2 ble 


ary judgment; now the question 
hen he is not required to. 


AN. 
ge OF THE VARIANCE IN MULTITRAIT RATINGS sed oF 
: early research that led to the hypothesis of a halo effect ee als 
correlation coefficients. When a foreman, for example, rates the individ 


in hi 5 : E . a 
1S crew on several traits, the correlations between trait ratings * 
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often quite high—as if the separate ratings were dominated by one global 
impression of overall effectiveness. Actually, correlation analysis is dumber 
some, because of the number of correlations required, hence Guilford 
(1954) adapted the analysis of variance to trait ratings. The main effects— 
due to raters, traits, and individuals—are interesting; but the special advantage 
of this analysis comes from the interaction effects, which are psychologically 
meaningful and often quite large. Guilford distinguished two components 
of the halo effect: the main effect common to all raters, which is the more 
general component; and that varying between raters, a relative halo effect 
which is computed as an interaction between raters and individuals being 
rated. But another interaction, that between individuals and traits, is 
inversely related to the older measure of halo, the intertrait correlation, and 
can be considered a measure of differential discrimination since it measures 
the raters’ agreement in assigning certain traits to certain individuals 
Willingham and Jones, 1958). 

An application of this modern analysis to the old question of the halo 
efect CJohnson, 1963; Johnson and Vidulich, 1956) began with ratings by 
college students of five individuals—Qucen Elizabeth II, Senator Joseph 
McCarthy, Sir Winston Churchill, Mrs. Eleanor Roosevelt, and Pope Pius 
XT—all in the public eye at the time, on five traits. The general halo effect 
4ppeared clearly. Pope Pius, for example, was generally rated high and 
Senator McCarthy low. To put this effect in terms of the old method of 
Measuring halo, it was estimated (Stanley, 1961) as equivalent to a mean 
Mterjudge correlation of about 40. The relative halo effect, the interaction 
etween was also significant. But the largest 


raters and individuals, 
which means that the 


interact i 9 NPE : 
3 S was that between individuals and traits, 
Judges minating judgments. They rated Sir 


did make analytical, discrir 
Senator Joseph McCarthy very 


iy nn, Churchill high on intelligence, J 
v on kindliness, and Queen Elizabeth high on pe sonal appearance. 

te between correlations which develop 
which are in the information that the 
s have received. Hence the experiment tried to separate these by 
Manipulation of the judging procedure. To minimize halo, one group judged 
~ Bve as OHE a day for five days, while another group, 
: idual a day on all five traits. 


his analysis does not differentia 
during the 


judging process and those v 
Judge: or 


ied diale oi one trait a 
ted to maximize halo, judged one indiv ae Hoek | 
tide he expected differences did not appear- jo there a no clear-cut 
Sites. for halo effect, when that term means a bias introduced by judging 
oe ions, The general halo effect could be due to mass communication, 
h brings standard information about the individuals to all judges, and 


€ relative halo effect could be due to selective communication. 
s is that the information received 


y “a basic difficulty with these analyse: o 
Y the judges is not controlled, as it is in the impression formation studies. 
res or, in the industrial situation, 


ej * , 
Judges are given the names of public figu à 
Employees and asked to sort out their impressions of these people on 


Seve : : ; 
tal dimensions. Or, if the judges ate asked to supply names of relatives 


and E 
acquaintances whom they can rate, the trait ratings are generally 
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correlated (Koltuv, 1962). But, in any case, the correlations within the 
information available to the judges, which have been experimentally manipu- 
lated in some impression formation studies, are unknown; hence the correla- 
tions between the dimensions of judgment cannot be clearly interpreted. 
Ideally, the output correlations would be isomorphic with the input correla- 
tions; to determine how the actual judgments deviate from this ideal would 
require a more complex analysis. 


STEREOTYPES 

Stereotypes and halo effects are similar in that they are intended i 
account for oversimplified judgments of persons, but the halo effect i 
supposed to develop in the judging process as a failure of analysis, nee 
stereotypes are described as previously acquired beliefs about types © 
people that influence present judgments. Stereotypes had a bad reputation 
for many years because of their role, both as cause and effect, in prejudice 
against minority groups. Recent research has rehabilitated this old concept be- 
cause, in competition with other effects, it is the stereotypes, especially i 
occupational groups, that make the highest contribution to accuracy i 
judgment and the teaching of stereotypes that offers the most promise or 
the training of judges (Smith, 1966). ie 

The importance of stereotypes here is that they illustrate another ae 
tion from the additivity assumption. If a woman is characterized by aa 
adjectives, the judge may add these somehow; but if she is first introducec 7 
an ophthalmologist, or as an exotic dancer, a cla ing or stereotyping apers 
tion will precede the adding. 


Some Characteristics of Judges 


When the abilities that distinguish man at his best are under gaion 
we often hear the phrase “good judgment.” This is commonly cited am 
the requirements for success, whether as an executive or motorist, quarter aia 
or parent. And good judgment is indecd complex; the research reviewes 


z 3 i f » abilities 
this chapter, together with previous analyses, calls attention to the ab 
involved in: 


abstracting the relevant facts from a complex situation, 
adopting and maintaining a set for multiple relevant facts, 
weighting each appropriately, 

integrating all this information to fit one of the response alte 
rather than the others, and 


communicating the judgment with due caution, 


AYN 


natives 
5, 


ain 
. . , ives conta! 
Judgment tests devised to differentiate good and poor executives - “ee 
i > -nate 
a large measure of general intelligence, and such tests do discrimina 


ang 
a n „m solving 
tween those who are generally good or generally poor in problem 
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CHarootunian and Tate, 1960). Judgment tests used in the selection of 
personnel often include questions on information specific to the job also, 
because the good judge in some situations is the one who knows the relevant 
facts. In addition, the good judge knows which facts are important and how 
they should be organized. 
, Many differences between judges have already been mentioned in passing. 
Substantial differences in preferences and in accuracy appear in the data 
on unidimensional judgments; in multidimensional judgments, differences 
in the weights attached to the dimensions have been computed, and some 
models distinguish judges by their combinatory operations. For example, when 
28 clinicians—psychologists with the Ph.D. degree and clinical trainees—made 
Summary judgments of persons from their scores on a personality inventory, 
Wiggins and Hoffman (1968) were able to categorize them according to the 
Judgment models that matched their judgments. The judges were given the 
Personality test scores of a large number of individuals and asked to rate 
each on an ll-point scale extending from neurotic to neutral to psychotic. 
The Judgments indicated that 12 of the judges were best described by a 
mear model, 3 by a nonlinear model (more precisely, a quadratic model), 
and 13 by a sign model. The signs in this case were certain diagnostic indica- 
tors acquired from empirical ‘clinical literature or from clinical folklore, 
Such as a score on a single scale or a linear or configural combination of 
Scores on several scales. Thus 12 of the judges were considered linear and 
16 Configural, 
ing ass: Hoffman, and Taber (1969) epee up a ae BP ey 
intelliow subjects Cheterogeneous in age en e ote exits aiie a ha 
tive ton a ae : geome nee ete information on 199 
individuale English, sppe n peT — analysis divided the 
Subjects als applying for admission to co i. = iher ie dais ati 
of a Dto 8 groups, which differed in the way they ma S SK 
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ation on study habits and responsibility, ical iii 
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es als ook a battery hili 
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APPear ‘ 
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Aabi č 8 e z k K 

abits were the oe the cot the highest scores for authoritarianism and 
g £ 


telio; 5 
wha a conservatism. Thus there seems 
at j = y , à 

Judges consider important in others 


to be some connection between 
and their own personalities; but 
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in most other such studies, relationships between judges’ personalities and 
their judgments have been weak. 

An analysis of the variance in multitrait ratings reveals interactions be- 
tween raters and individuals, which probably reflect differences in informa- 
tion or in the acceptance of it, but also between raters and traits, which 
more probably arise in the individual’s judging processes. Furthermore, some 
judges are more inclined than others to stereotyped thinking, and even 
unprejudiced thinkers seem to have implicit theories which lead them to 
make judgments as if certain dimensions of personality were assumed by some 
judges to be correlated and other dimensions by other judges. 

Another characteristic that could produce pervasive individual differences 
in multidimensional judgments is a judge’s cognitive complexity or dimension” 
ality. How many dimensions does he use? One approach to this has the judge 
list the dimensions that he proposes to use in judging persons; a second 
provides the dimensions for him. Cognitive complexity is studied by com- 
paring the judge’s ratings of a person on one dimension with his ratings 
of this person on other dimensions. The complex judge makes more 
differentiated use of the dimensions; the simpler judge makes the same ratings 
on different dimensions, or, in effect, uses fewer dimensions (Bieri et al., 
1966). This differentiation or complexity in a judge’s ratings can be me? 
sured by a count of differences between ratings on the different dimensions 


by factor analysis of the correlations between dimensions, or by analys 
of variance. 


In different experiments, the judgments of complex judges have see 
in many ways from those of judges with simpler cognitive structures, ra 
the difference most pertinent here is that the complex judges handle eee 
tent information more adequately (Bieri, et al., 1966), The simpler jape 
blur the distinctions between dimensions—as if they were more susceptible 
to halo effects—or fluctuate with the most recent information, while fe 
complex judges preserve the distinctions and try to make sense out gial or 
data. In this respect older children are presumably more complex than yawe 
children. The discussion of the development of conservation in a 
noted that a judgment of quantity can be treated as an integration of ps 
ments of height and breadth, with older children using both dimen 
while younger children discount one as inconsistent with the other. 
experiments on judgment have thus far seldom taken advantage © 
developmental approach. 

ese various characteristics of judges have, with few excep 
computed only once and only for judgments of persons. The me 
developed as a by-product of experimental and social psychology $° ihe hese 
usual psychometric questions about the stability and generality of t 


. i ie A i a 
erect differences have not yet been answered. On the other sia cho 
inte lectual abilities of symbolic and semantic judgment identified by 4 Pod 

metrically sophisticated program of factor analysis (Guilford, 1967), °° as 
interesting by the 


G . ces 
mselves, have not yet been related to the judgment Pr? 
discussed in this chapter. 
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Major Trends and Implications 


Judgment begins, from the psychological point of view, with a disorderly 
situation, confused bv alternatives, that interrupts the individual’s course of 
action; it ends with a more orderly one, that permits him to move on with 
less uncertainty. The end in view is an ordering of the alternative responses 
to correspond with some physical dimension of perceived stimulus objects, or 
with some abstract cultural dimension of words and other symbols, or with 
the attractiveness of the objects at hand and of expected consequences of 
alternative actions. It is experimentally convenient to study judgment by 
itself, isolated from other intellectual operations such as productive thinking, 
but everyone knows that judgments are involved at critical points through- 
out a complicated problem-solving project, as in the initial choice of a 
Strategy and the final choice of one of the alternative solutions. The decision 
to begin is itself a risky decision because, as John Dewey put it, “If we once 
Start thinking no one can guarantee where we shall come out.” 

One test of unidimensional judgments is consistency within the judgments 
of one individual, which, in the case of preferences or affective judgments, is a 
Measure of that elusive desideratum called rationality. Another is agree- 
ment between independent judges, which depends on a common conceptuali- 
ation of the stimulus dimension and can be considerably improved by 
pPropriate training. When an external criterion is available, objective mea- 
SUres of accuracy can be computed, which indicate that ge a 
1 a: © : š cei è i he amoun 
sa a e iaaa kear nk, ie, Booed 
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than interactions, and in multitrait ratings, correlations between traits rather 
than sharp differentiations. There is a tendency to put the objects of judg- 
ment into classes or, if they are people, into stereotypes, a move that may be 
effective or not, depending on the accuracy of the stereotypes. In either 
case, it simplifies the structure of the input information. The ultimate simpli- 
fication is attained, of course, when people or ideas are divided into two 
categories—the good and the bad, or those on our side and those on the 
other side—and the information received is adjusted to permit two-category 
judgments of all instances. At the other extreme, high fidelity to the struc- 
ture of the data could best be attained by graphic or mathematical representa- 
tion of complex relations—usually not allowed in judgment experiments. 

The linear additive model, in one form or another, ‘has been a useful first 
step in describing how the judge combines multidimensional information 
into a summary judgment. But it has been necessary to add weights, to com- 
promise between adding and averaging, and to recognize the occasional 
occurrence of configural combinations, In linguistic structures adjectives are 
multiplied by certain adverbs, and in risky decisions outcome values are 
often multiplied by probabilities. The environment contains a diversity of 
objects to be evaluated, some independent, some correlated, and the various 
models of judgment have each staked out a little territory within which 
they perform moderately well. Today the linear additive model is one among 
many. 

The contribution of learning to judgment parallels the contribution of 
learning to the solution of other types of problems. Dimensions, values; 
probabilities, and such cognitive structures as stereotypes and implicit 
personality theories are learned. And, since we know so little about the accu- 
racv of most judgments, we cannot offhand deny the claim that the 
accuracy of most judgments depends as much on previous learning as on the 
details of the judging operations described in this chapter. Cognitive learn- 
mg experiments have demonstrated also that the intelligent adult can eee 
to make configural judgments with suitable instructions and information 
about errors. It is not surprising that a man who can shift back and forth, as 
the terrain requires, from activities as different as swimming and climbing 
can shift also from additive to multiplicative integration of information: 
Sass nee oe judgments as their peers and teachers do ae 
ecological validity ba a sorti bur Gyin a dew oe me a pimeni 
ora supervigin's eneon ee N aie’ uni ities ap- 
E cn res a judge may also practice an intui ele 

8 himself in the raw data and trying to make emp 


lobal j l 
g e judgments, but Progress in this type of training is not usually SY 
os to quantitative analysis. it 

re i ical 

that snk psychological research, like revolutions, often betray the SP a 

c d sj gule 
ae em going. What happened to the Gestalt emphasis on config ing 
o edn a negative sense, Gestalt claims about the structure of the combi” a 

perations have been accepted because no one now defends a simple eet 
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del. The Positive side of the Gestalt contribution was not $0 exp! 
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configuration is a general notion about multidimensional judgment rather 
than a testable hypothesis. The explicit hypotheses have been stated in mathe- 
matical terms, such as nonlinearity, multiplication, and interaction, which 
in some intuitive sense seem to be more structured than additive combinations. 
The ideal that the structure of the judgments should mirror the structure 
of the data has a Gestalt quality, but the tests will come from mathematical 
analysis, 

The overlap between the judgment experiment and the concept experiment, 
Which ties the last chapters of this book to the first, is manifested, not only 
in the language and the explanatory principles but also in the research 
trends. Both have shifted from concrete to more abstract materials, from 
synthetic to more realistic ecological objects of thought, from regularities to 
Probabilities, from simple to complex combinations, from single to multiple 
measures of achievement, and from an exclusive concern with general laws to 
à concomitant concern with individual differences. 
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event to a certain response category. Since t 
of the response is communicative, it may be ™ 
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a word, or gesturing. In any case the judgment mean 
pressed by one of a small number of responses, = f Jaw: 
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for example, the jury is instructed at the outset what categories to use in re- 
porting the verdict. The choices open to players in most games are stated in 
the rule book. In the psy 
what responses he can make and what they mean. Thus when the judge makes 


-hology laboratory, the experimenter tells the subject 


a judgment, he is set, not only for certain stimulus dimensions, as noted in 
the previous chapter, but also for certain responses. 

Judgments are collected in many psychological experiments, but the 
judging conditions are standardized so that some other function, such as 
Vision, is emphasized. Judgment experiments, on the other hand, manipulate 
ze the judgment function, with the aim of discovering 


conditions that emphas 
general principles that apply to all objects of judgment. While the previous 
chapter focused on the stimulus side of the judgment, the input, this chapter 
will focus on the response side, the output. 


Scales of Judgment 


Judgments are frequently passed on from one person to another in terms 
of standard scales (such as weight in ounces and length in inches) that are 
learned in school and used in ordinary communication according to standard 
usage. The ability to look at an object and estimate its length in inches in- 
Volves achievement of a dimensional concept, length, and a unit, the inch. 
Uniformity of usage of these standard scales develops through comparing 
judgments with sealing of standard measuring instruments, such as the 
footrule. Whatever the mode of acquisition may be, these are important 
Cognitive achievements by which the individual knows the environment and 
Communicates with others about it. 

Contrasted with these are temporary rating scales, i 
long, and 1, 2, 3, 4, 5, that are acquired in specific situations, have communica- 
tive significance only in these situations, and are quickly forgotten. Temporary 
Scales are used in many practical situations where standard scales do not 
apply; they appear in judgment research more often than standard scales 

ecause their development can be experimentally manipulated. 

here is an interesting analogy between the judgment experiment and 
the Concept experiment. A a conventional concept experiment, when subjects 
achieve the class of red squares, they are not learning = — is and 
Square, h y attaining a temporary conjunction of two familiar con- 
cepts, ai alen esibeck of a judgment experiment rate squares 
as small, titedium, and large, they are not learning the concept length nor 
the Meaning of the three adjectives; they are working out a temporary adjust- 
Ment of the three-category scale to the objects in view. 


such as short, medium, 


SCALES AS COGNITIVE STRUCTURES 

Considerable progress in judgment research has been made by study of 
€ association between responses, such as short, medium, and long, or 1, 2, 
> 4, and 5, and objects of various lengths, just as considerable progress jn 


386 =|=! JUDGMENT: RESPONSE VARIABLES AND SCALES 


concept research has been made by studying the association between same 
mon responses and dissimilar objects. Most of the research summarized in 
this chapter follows this approach. But there are conditions for which such a 
direct approach is not sufficient. Guilford (1954) has pointed out how classical 
psychophysics assumed a direct linear relation between responses to sensory 
stimuli and overt judgment and that, while this assumption is often justified 
there are exceptions which necessitate discussion of the “judgment contin- 
uum” as separate from the immediate response to stimuli. 

Furthermore, the direct approach misses another important kind of be- 
havior. Sophisticated people talk about scales as they talk about concepts, and 
this kind of behavior is as important for a theoretical understanding of 
judgment as any other. It would not be unrealistic to ask a college student: 
On what scale is 5 degrees a little above freezing? What is the meaning of 
a score of 90, or 125, on an IQ scale? Just as one who has thoroughly learned 
to classify objects can also describe the class, one who has accurately judged 
objects on a scale can also describe the scale. One who can judge the lengths 
of lines in inches can also draw lines to specifications in inches. College 
students can describe scales by an adjustment method as by locating the 
boundaries between scale categories. Investigations of the context effect, to be 
mentioned later, show that scales organized from judgment of objects m- 
fluence production of objects, and production of objects influence judgment 
of objects. Such behavior can best be understood by treating the scale, like 
the concept, as a cognitive structure, manifested under different conditions 
by different behavioral achievements. The reason for calling the scale a 
structure, of course, is that the categories are ordered. edf 

Suppose, by w y of illustration, that someone is asked to judge a series ° 
lines as short, medium, and long, After he has judged all lines two or three 
times, the three-category scale will be adjusted to the series of lines, a 
shown in Fig. 9.1, and later judgments of these lines will be consistent. Te 
stimuli Çi.e., the lines) and the responses (i.e., the words short, medium, an 
long) are observable, but the scale is a theoretical cognitive structure hypothe 
sized as a mediator between the stimuli and the responses. It might not a 
necessary to hypothesize this cognitive structure just to account for the 
judgments. But once the s i e if asked, 
draw a line th 
at the bounda 


ubject has organized the scale, he car, pa 
at is of medium length according to this scale, or one tha pa 
ry between short and medinm. He mav or may not have @ 


FIGURE 9.1. Illustrati 
stimuli and the res 
assumed to mediat 


on of a temporary scale adjusted to a series of stimuli. The R 
A z iti r p 

ponses are observable; the scale is a hypothetical cognitive structit 

e communicative responses. 
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image of the scale, but nearly any subject, if asked, could represent the 
scale graphically, by a drawing like Fig. 9.1. 

In the experiments to be described in this chapter, the association between 
the stimuli and the responses is a temporary one, dependent on the temporary 
scale. A different series of stimuli may lead to the organization of a different 
seale and the responses mediated by this scale will be associated with different 
stimuli. But for the time being, the objects of the world are known and 
communication takes place through the medium of these scales. 

Standard scales of inches and ounces are more permanent achievements, 
learned in school and retained throughout life, but their theoretical status and 
their influence on judgment are the same. In many cases, as in the judgment 
of attitude statements for the construction of attitude tests, there are no 
standard scales, only ad hoc scales. 

One might expect that the development of a temporary scale for judging 
lengths of lines would be inhibited by the permanent scale of inches; actually, 
the subjects in judgment experiments do not confuse temporary scales and 
Permanent scales any more than playgoers confuse the temporary and perma- 
nent names of actors. The subjects seem to follow the experimenter’s instruc- 
tions at least as well as the experimenter’s theories follow the subjects’ be- 
havior, 

The implication of this discussion of the nature of scales is that for many 
Purposes judgments can be simply tabulated and directly related to the 
Stimulus objects, but, when difficulties of interpretation arise, the hypothesis 
of a scale as a cognitive mediator should be considered. Some of the experi- 
Mental] manipulations to be described influence the relation between the 
Stimuli and the scale: others influence the relation between the scale and the 
Judgments, 


JUDGMENTS AND SCALES 

Scales mav be bounded or unbounded. Many standard scales, such as 
Scales of length and weight, are bounded by zero at one end but extend in- 
definitely at the other. Most temporary scales are bounded at both ends, as 
when the response categories are restricted to very unsociable, unsociable, 
Sociable, very sociable ta 1, 2, 3, 4, 5, 6, 7, and some of the interesting 
Phenomena ‘of ee occur at the ends. But under some conditions, 
Ounded scales me undesirable. 1f the judge has used the most extreme 


Cate . f hn evi P BY en 
Ategory for the most extreme stimulus he has encountered, and then en 


Ce . m ae Si i 3 
Unters: an even more extreme stimulus, he cannot report his judgment 


d because he has exhausted the avail o 

“culty, Stevens (1956) introduced the unbounded or open-end scale, 
ich | : his own categories of judgment; he 
this procedure magnitude estimation. The instructions ae spect 
€ middle cateoory as 10 or 100, to be used for a specified standar stimulus, 
av ra tes his judgments of other stimuli with 
0, 100, 110, 120. An alternative term, 


able categories. To avoid this 


Permits the judge to choose 


i j i ic? 
Nur eS the judge free to communicate 
Mbers such as 2,5, 10, 20, 40, or 80. 9 
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numerical matching (Stevens 1966a), describes the actual judging operations 
more clearly. 

Unbounded scales have certain advantages for the study of context and 
anchor effects because they reduce end effects: they are nat appropriate, of 
course, for studying end effects. Since the response categories used are vari- 
able, these scales are not suitable for the study of the amount of information 
transmitted by judgments nor for the study of judgment phenomena around 
category boundaries. 

Occasionally, judgments are communicated by assigning stimulus objects 
toa two-category scale, such as short and long or approve and disapprove, 
and by ranking all objects in order of magnitude. In any case, the instructions 
specify the stimulus dimension to be judged, whether length, attractiveness, 
or conservatism, and the terms in which the judgments are to be reported—ex- 
haustively for bounded scales, illustratively for unbounded scales. The terms, 
whether words or numbers, are chosen so that they are familiar to the judges, 
and thus the order is specified in advance. Unsociable is more sociable than 
very unsociable; sociable is more sociable than wnsociable; and so on. In other 
words, the judge begins with an ordered scale. The terms or responses in 
which the judgments are expressed, such as sociable and 4 are names of cate- 
gories of the scale to which the stimulus objects are asigaed, They are not 
unrelated categories, but adjacent intervals of a continuous scale. Hence the 
word interval might be more appropriate, but category has the sanction of 
usage, 

When comparative judgments are made, each stimulus being compared 
kt inc me a ee daer aet 
S ‘ ategories is known as the point of subj 
tive equality and the difference between this point and the standard stimulus 
is a well-known systematic error. Judgments of single stimuli have also been 
complicated by the introduction if a standard or anchoring stimulus— 
Smif EE 
ae and experiments of this type demons oe 

Portance of the relevance of the standard, or the attention directed to it, fo 
the organization of the scale of judgment. 

0, ee ne to be bipolar in nature, Although a scale of -2, a 
econ, Oe ag Racal to a scale of 1, 2, 3, 4, 5, in some a 

arthom seed extends both ways from the zero or neutral aoe hé 
pitve a table A is more easily used. On scales of affective va e 
or unpleasant side a re ie re aye: cea se n a 

è; ell fegin of he aa region feels different from either eee 
ward edie, of ie mperature scale seems to be more than an evi- 
denes ta suppätt of e region. But when we look for guaniiteuvi aa 

€ statements, it is not obvious what kind of evide 


we want, i i i i i id 
been ae we shall see that different kinds of supporting evidence me 
When judgments are made by 


: 5 in 
5 the ranking : ity appears 
the order in which the ng method, bipolarity app 


judge works. Tresselt (1965) asked subjects to rank 15 
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colors, and observed that they began by ranking the first, second, third, and 
so on, to about the eighth, then moved to the fifteenth, the fourteenth, and 
šo on in the other direction. Certainly the first position is definite, the last 
perhaps less so, and both act as poles or anchors for adjacent positions. If the 
subjects were to repeat the rankings, the first and last ranks would no doubt 
be less variable than the middle ones. 

The variety of judging operations which the intelligent cooperative human 
subject can perform with some consistency has led to many arguments about 
the validity of the results obtained. The most generally valid statement that 
can be made is that human judges are extremely flexible: they can abstract a 
Single dimension from multidimensional stimulus materials, conceptualize a 
scale of gradations along this dimension, and utilize many systems of verbal, 
Numerical, or manual responses to communicate about stimuli in terms of 
this scale or, indeed, about the conceptualized scale. 

Arguments about the validity of scales and about the superiority of one 
ver another should not obscure the fact, based on many comparisons, that 
scale values obtained by one method usually are closely correlated with those 
obtained by another. The differences are in the absolute values, in the 
Variabilities at different regions of the scale, and in the shape of the functions 
relating judgments to physical magnitudes. T 

here are no absolutes in absolute judgment or magnitude estimation 
and no necessary connections between the judgment output and the sensory 
input. But the intelligent, cooperative subject can deliver consistent results 
under various conditions, even under changing conditions, and he can, to 
Some extent, as Poulton (1968) put it, “calibrate and recalibrate himself.” In 
Presenting his theory of adaptation level, Helson (1947, 1948) has argued 
that the changes in the observed judgments reflect changes in perception, and 
a fair share of the research to be reviewed has been stimulated by adapta- 
tion level theory. Hence we must return later to some theoretical questions 
abour the mi zaning of judgment data and especially of shifts of judgment. 


METHODS FOR DESCRIBING SCALES OF JUDGMENT , 

A set of numbers from 1 to 7 and a set of words like short a wal oe 
*osely called a scale of judgment, but these are <p called re- 
SPonses, or a response system. To describe a scale of ju one ae to 
“scribe such responses but how they are related to the es 1 ‘hn ma 

ree classes of methods are frequently employed to describe “a scale 
the Judge uses in making his judgments. The observed data are the judg- 
Ments given in response to the stimuli and the instructions; Le scale is a 
YPothetical construct derived from these data. But ay ee j E often 
quite stable, hence this hypothetical construct can be 2j ed with more 
Uantitative precision than most constructs of pye on a i 
; ° avoid confusion, a distinction should be drawn between Cte y of 
Judgment for its ae sake and the technology of scale Sensuucuen) from 
Judgments Th ` ed with description and explanation of 
the j >» The former is concern judgments for other purposes. The 

Judgment process while the latter uses JUCS 
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two vnterprises interact and profit from each other, however. Knowledge of 
errors of judgment has reduced some errors of scale construction; and re- 
search on scale construction has enlightened the psychology of judgment. 


CATEGORY BOUNDARIES IN STIN JALUS UNITS 

If the stimulus dimension is a quantitative one, the boundaries between 
adjacent categories can be calculated in units of the stimulus dimension by the 
method of successive intervals (Guilford, 1954; Torgerson, 1958). The 
transition from one category to the next is tabulated as a cumulative fre- 
quency distribution, then the average of this distribution is taken as the limen 
or boundary between the categories. Fig. 9.2 illustrates an application of this 
method to a twocategorv scale developed by one subject for judging a series of 
weights, and Fig. 9.3 a four-category scale. Fig 9.4 represents an application to 
children’s use of semiquantitative terms referring to various numbers of 
printed characters. The number of category boundaries is one less than the 
number of categories, and the widths of only the internal categories are 
quantitatively described. Some of the important phenomena of judgment T 
boundary effects that can be understood only in relation to the location © 
the category boundarie l 


Judgments in only two categorics permit computation of only one boundary. 
The scale thus described is the simplest possible, but the single ines 5: 
boundary is a neutral level or indifference point of considerable rheoreHee 
significance, like the point of subjective equality obtained from comparative 
judgements with a standard stimulus. 


Percent frequency 


OSN eae Heavy 
—— —— = nee | 
20 30 FNN "T T T 


40 50 60 70 80 90 100 
Weight in grams 
: a 
FIGURE 9.2. Transition zone and category limen for a two-category scale adjusted á 


a of 4 stimulus weights. One subject judged each weight 16 times. The limen 
erween light and heavy was computed as 48 grams. From Johnson (1944). 
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Percent frequency 


Very light Light Heavy Very heavy’ 
90 100 


20 30 40 50 60 70 80 
Weight in grams 
FIGURE 9.3. Transition zones and category limens for a four-category scale. The limens 


between adjacent categories were computed as 24, 47, and 76 grams. 
From Johnson (1944). 


Many hypotheses about judgment are stated in relation to the center of the 
scale, and the central value of a two-category scale is, of course, the boundary 
etween the two categories. Generally, the middle category boundary has 
“en taken as a central value for scales of an even number of categories. 
ther, more elaborate procedures, to be described, have also been used. 


NUMERICAL RATING SCALES 6.7 
any judgments are expressed on a rating scale, such as l, 2, BA, By Oy 

‘ hese have often been called absolute judgments to differentiate them from 
Judgments made in comparison with standard; otherwise the term is mis- 
cading since most of the research on absolute judgments has been directed 
toward their relativity. Another familiar term is judgment of single stimuli. 
“OY some purposes, it is convenient if the rating scale has as many categories 
as there are stimuli. Unbounded scales also have their peculiar advantages. 
“merical ratings are widely employed for practical and experimental 
Purposes, Frequently, the numbers given to the categories are assumed to refer 
to equal intervals on the scale and are then used for regular arithmetical 
manipulations. Thus the above method describes the scale categories in 
Stimulus units and this method describes the stimuli in rating-scale units. The 
tect of unequal intervals on the data obtained in this way has been em- 
Phasizeg and minimized elsewhere, but in judgment research the width of 
“Scale intervals has been studied directly, for example, in relation to 
atitudes of acceptance and rejection of attitude statements. Naturally, the 
Most precise quantitative information comes from temporary numerical rating 


392 

gais few several some | tot 
T T ï 

pone few several | some | lot | many 
r -7 T u = 

Grades 

10 and 11 few several some | lot | many 
T T r 
3 8 13 18 23 


28 33 


JUDGMENT: RESPONSE VARIABLES AND SCALES 


Number of things 


FIGURE 9.4. Quantitative representation of scales used by children to designate y 
various numbers of printed characters. Data from Brotherton, Read, and Pratt (1948). 


scales organized around stimuli that are quantitatively described; ideally, 
when the relation between judgment scale and stimulus series is linear and 
the intervals are equal, the scale can be simply specified as to origin and unit 
in terms of the stimulus dimension (Torgerson, 1958). Helson and na 
stein (1955) have outlined a least-squares procedure for computing a cents 
value, or adaptation level, and other parameters for the case in which a 
logarithmic relation is assumed. 


ADJUSTMENT OR PRODUCTION METHODS ius 

Under some conditions, the subject represents his response scale by adjust- 
ing a pointer, by reproducing a line, or by some other manual adjustment. 
Almost anyone can describe his scale of heights of people by holding me 
hand at a height above which he would call people tall and below which 
would call them short. In one experiment (Johnson, 1946), the subject san 
pennies as close to a wall as he could, then later placed a penny on the floor Pa 
indicate the boundary between the categories worse than usual and better sh 
usual. Fig. 9.5 shows relations between a response scale computed a 
adjustments of a line to a requested angle and the objective stimulus varia i 
This method indicates, for example, that at 20 degrees and 70 degrees 1 
scale corresponds closely to the objective stimuli, while at 30 degrees a 
degrees the discrepancies are relatively large. A recent ingenious vaną e- 
of the old lifted-weight experiment (von Wright and Mikkonen, 1964) a 
quires the subject to adjust a weight to correspond to the medium apt 
In experiments on attitude scales, college students have been asked to re 
statements that fit Category 1 and Category 11 COstrum, 1966). dine 
methods are not always convenient, but when they can be used, they dem 
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strate the varieties of behavior that are related to the cognitive struc 
known as a scale of judgment. 


tion 


on- 


Response Category Bias 
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It has long ie 
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been known that people prefer to make judgments 7 ples 


nek: Iti 
categories rather than others. Census reports show a preference for mu 
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of five and for even numbers in reports of age. When sentencing criminals, 
judges prefer round numbers, as do psychologists and other scientists reading 
laboratory instruments. The effect of a preference for any category is to in- 
crease the width of that category. In reporting age to the nearest birthday, for 
example, if the response scale were uniform, the category 49 would be used 
for ages between 48.5 and 49.5, the category 50 for ages between 49.5 and 
50.5, and so on, each response category being one year wide. But if there is a 
preference for 50, this category will be used by some people of age 49 and 
Some of age 51. Hence the category 50 might extend from 49.2 to 50.8, 
thereby attaining a width of 1.6 years, while the 49 and 51 categories shrink. 
Response preferences are particularly noticeable when the categories are des- 
ignated by semiquantitative terms, such as very and medium. When the wit- 
ness tells the jury that the man who stole the money was of medium height, 
what does the label medinm mean? 

Some quantitative evidence on this question was collected in an attack 


reg URE 9.5. Relation between stimulus ~~ ees siege a 
i j j ked to a i1 
Sg, 300 fae hod: The ajea ae was 28.6°, as shown a =a ge When 
jude called for, the mean adjustment was 4.5°- Then we 6 ous and 2,87 
Sments. Data from Kaufman, Reese, Volkmann, and Rogers . 
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on an old question of psychophysical methodology. In comparative aon 
should the subject be restricted to two categories, such as shorter and lor i s 
or should he be permitted to use a middle category also, such as iega i 
doubtful? Early discussions of this question dealt with the validity ‘ of ou: i 
tive reports of the impression of equality. But when this ee a 
aside and the equal response was treated as the middle category of ath i 
category scale, the question became amenable to objective ae = 
a weightlifting experiment, Fernberger (1930) showed that when si ie 
structions emphasized finding a difference between the standard and pate e 
weights, the width of the middle category was reduced to 2 grams: but X sien 
the instructions permitted free use of the equal response, the middle category 
attained a width of 10 grams. at 
Response bias appears also in judgments of single stimuli. A class esp 
ment in a laboratory course at Michigan State Unive ty requires judge i 
of a graded series of eight gray papers as light, medium, and dark. The oe. 
data from one group of ten students gave evidence of a middle category oe s 
3.1 to 6.4, a width of 3.3 in the units of the stimulus series. For another r 
of ten, told that medium was a nondiscriminating response to be avoided, 
middle category extended from 4.0 to 5.4, a width of 1.4. f i By 
Either end category of a three-category scale can also be widened >} 


; $ had 
response bias resulting from differential reinforcement. Le Furgy Su the 
è ) 


ther 


a 


a series of circles judged positive, neutral, and negative, according t 
reward expected. But whether one end of the stimulus series or the © 
was reinforced, the positive category was the widest. | 
Unplanned category biases are not always as large as in these par p 
lustrations, but the effects often interact with other effects to make interp" to 
tion difficult. Individual differences in the use of a scale are often ene 
habitual tendencies to stress or to avoid certain categories. Instructions pa 
lead one judge, whether deliberately or incidentally, to attempt to “ronisidet 
category of the response scale equally often, while another may not €O 
this requirement. 
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culations will usually show that the boundary between the two ting scale 
occurs near the middle of the stimulus series. When a numerical ratis iler 


is used, the larger numbe 
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rs are assigned to the larger stimuli and t ponds 4P 

numbers to the smaller stimuli, so that the judgment scale corres} lled ° 
proximately with the 
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SE ILLUSTRATIONS 

The context effect, like many others, was discussed under one name or 
another on the basis of subjective observations before objective data became 
available. Around the turn of the century, according to Woodworth (1938), 
German investigators reported that after a series of weight judgments the 
observer was no longer comparing the lifted weight with the image of the 
Standard weight but with some absolute impression of the whole series. 
Similar phenomena were noted in studies of pleasantness and unpleasantness 


in the 1920s and summarized by Beebe-Center (1932) as “mass contrast” 
Later, Hunt and Volkmann 


rather than contrast between individual stimuli. 
(1937) made use of such results on affective judgment to state a general 
Principle of judgment. 

he context effect showed up in anothe 
found that a cadet has a better chance of passir 
is grouped and checked out on the same day with cadets of low aptitude than 
with cadets of high aptitude (Krumboltz and Christal, 1957). The effect has 
turned up also in ratings of foods by a panel of housewives (Fine and Hag- 
Sard, 1958), in judgments of photographs of applicants for admission To 
Staduate school in psychology (Lewy, 1960), in judgments of the tensy 
of electric shocks (Bevan and Adamson, 1960), in judgments of short descrip- 
tions of aggressive behavior (Miller and Bieri, 1965), and in the scaling of 
Social objects by several methods (Hicks and Campbell, 1965). Judgments of 
i of photographs of faces expressing 


r form when aviation psychologists 
ng the primary flight test if he 


the ale 4 ‘ 
; € pleasantness of a standard series A 
“motions were shifted toward the pleasant end of the scale when judge 
Within the context of unpleasant verbal descriptions and toward the un- 
Pleasant end within the context of pleasant descriptions (Manis, 1967). 
© context effect is «ct a matter of repeating Judg S ady ; 
_ The context effect is not just a matte f repeating judgments already made 
it extends beyond the typical judgment situation. When children were asked 
to take a few beads from a trav, or some of them or @ lot of them, the number 
at took under each instruction was influenced by the context en BS 
£ beads in the tray (Cohen Dearnley, and Hansel, sere (see ig A ) 4 
Series of ; z -g oe as large or small determined the size 
of th a aigles presented for es all larg periment, the size of the 
the angles drawn later, just as. in another experiment, 
angles drawn determined later judgment of angles (Jennings and Johnson, 
- A reproduction method, modifying the familiar lifted-weight experi- 
ht and Mikkonen, 1964). The 
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FIGURE 9.6. Context effect and number concepts. When children were asked to take a 
beads out of a tray, the meaning of concepts such as some of them depended in part 
the context furnished by the number of beads in view. Data from Cohen, Dearnley. 


and Hansel (1958). 


obvious; an act of judgment often precedes the planning of the next actio 


and such judgments also are subject to context effects. 


QUANTITATIVE ANALYSIS 

There is no doubt about the generality of the context effect, but if w 
more detailed questions, quantitative research is necessary, for whicl A 
simple two-category judgment is a convenient introduction. As an illustration, 
when a series of 17 weights ranging from 20 to 100 grams by 5-gram in 
was judged repeatedly in two “categories, light and heavy, the sarge 
limen was 48 grams (Johnson, 1944). This value is near the middle of t : 
stimulus series, to be sure; but the exact middle is 60 grams if it is defined j 
the midpoint between the two ends or as the mean ar medan of all welg 


NC . is i gen” 
hence the obtained value is lower than any simple prediction. This is 2 8 i 
eral result of sev i ale 


e ask 
h the 


Á eral such experiments, which suggests that the s¢ 
organized from some decreasing function of the stimulus magnitudes. tar 
logarithmic function is cominanky assumed for weights, leading to comply a 
tion of a geometric mean, which in this case would be 54 grams, a some is 
better prediction. Linear functions fit the data of some experiments ae 
kind, and in others logarithmic functions or power functions are better 
pending on the stimulus objects and the receptor system. 

The typical judgment experiment presents cach stimulus equally 
So: that the context of judgment is a rectangular distribution of 
Distributions with the same range, but of positive or negative skew, PY 
different contexts. When the above series of weights was skewed posit!” 


s often 
stimuli- 
prov? i 
ely» 
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N 


bv presenting the lighter weights more often, the category limen was 30 
grams, and with a negative skew it was 70 grams. This effect of skew is a 
general one, but distributions of other shapes have also been arranged, as we 
shall see later, in order to analyze the details of context effects. 

When the stimuli are sounds of various frequencies to be judged for pitch, 
a logarithmic receptor function is commonly assumed. One experiment with 
seven series of sounds covering much of the audible range included skewed 
distributions of stimuli and computed the geometric means of the frequencies 
to predict the boundaries between the judgment categories low and high 
(Johnson, 19492). Fig. 9.7 shows that the values predicted in this way come 
close to the obtained values. This figure also displays the large magnitude of 
the context effect; the frequencies of Series F and G, judged low, were ob- 
jectively much higher than those of Series A and B, judged high. 

Comparative judgments, as when stimuli are judged lighter or heavier 
than a standard stimulus, are not immune to the context effect. Data obtained 
by the method of constant stimuli indicate that the reference for the judg- 
ments is not the standard stimulus alone but the whole series of stimuli as 
Well. The data can be predicted under some conditions by computations 
that weight the standard stimulus one-half as much as the series stimuli 
(Johnson, 1945) or one-third as much (Helson, 1947). The effective weight 


of the standard is presumably a consequence of the attention it receives and 


may reflect subleties of the instructions and experimental arrangements. 
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°m Johnson (1949). 
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COMPLICATIONS AND LIMITATIONS 
The context effect can be quantified most accurately when judgments 
of different series are made by independent groups. If the same sas 
judge different series, the change in the scale of judgment indicates the 


direction of the shift of context: but the effect is complicated because the ad- 
justment to the second series is retarded by experience with the first. The 
effects of such shifts in the stimulus series deserve careful study in their own 
right in later pages. 

ation of the stimulus series from 
other stimuli. The context does not include all the similar stimuli that the 


The context effect depends on the seg 


gi 


judge can remember, and if his previous memories are strong and relevant, 
a simple calculation of the context effect will not be adequate. | lelson’s ( 1964 j 
quantitative formulation of these phenomena, which explicith provides ro 
the residual influence of previous experience, will be discussed later. But as 


a general observation, [lelson has stated that “sensory processes arc usually 
less subject to the influence of residuals from past experience and imagination 
than are affective and cognitive processes [p. 371 ]." 

If someone judges lengths of lines on a well-learned scale of inches ue! 
extends from zero to infinity, there is no reason to expect a strong contest 
effect because there is no limited lus 


ad stimu 
scale to be matched to a segregated stimt 
series. Even so, when the 


of context 


series of lines is limited, some evidence is 

š , Š è d oras 
effect can be obtained (Krantz and Campbell, 19610. Judgments of wo 
contes 


on the semantic differential seem to be particularly resistant to the 
effect (Sommer, 1965), and other exceptions will be noted in the section OF 
attitude judgments. i 

The response system by which the judgments are communicated also is 
troduces variations. The context effects that show up when unbounded scales 
are used (Hicks and Campbell, 1965; Ross and Di Lollo, 1968a) are ca 
parently smaller than when category scales are used—although, since mA 
scales differ, comparisons are difficult. In one experiment on weight jies 
ments (Ross and Di Lollo, 1968h , the use of a twocategory scale shows 
3 adjuste 
o 600 


tween 


rather complete context effects in that the judgments were closely 
to the series presented, whether it was from 100 to 300 grams or 400 t 
grams. But the use of a six 


category scale disclosed some separation be 

a n Gta, = . . f we y 
the two series because, as the authors put it, the “locations of the live lii is 
aries defining the six e Cone 


ee he 
criminative 


unc 


ategorics are determined partly on relative ni 
ka i : i sits 0 
considerations and partly to convey absolute judgments 
series as a whole [p. 550].” 
In a general Way, since 


Be F „simulis 
the scale of judgment is anchored to all the s 
the context effect could 


ye effect- 
i stimuli 
he shift 
ater 


be considered a comprehensive anchor 
But the term anchoring is typically reserved for the effects of speck 
and for the effects of experience with a series oF stimuli i retarding t 
of the scale of judgment, effects that will be svstematically considered y i 
Although the phenomena outlined above have been manifested in seve 


WF a rent uses 
ways and have gone by se 


veral names, the name context effect, in cur 
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is the name of choice. It is not easily confused with other terms and is 
descriptive rather than interpretive. 


The Central Tendency of Judgment 


The early work by Hollingworth (1909) on inaccuracy of movement em- 
ployed a reproduction method. His subjects had the task of reproducing the 
extent of a guided hand movement by drawing a line on paper. Three series 
were used and it turned out that in each the reproductions of the larger 
distances were too short and the reproductions of the shorter distances too 
long. Hollingworth spoke of a “central tendency,” by which he meant that 
the constant crror of adjustment was toward the center of any series. The 
same tendency appears in reproductions of short temporal intervals CTurchoie, 
1948). In tracking tasks also there is a tendency, called a range effect, to 
overshoot small inputs and undershoot large inputs (Slack, 1953; Weiss, 
1955), 

Fhe central tendency can be observed equally 
are underestimated and the smaller 
ts of areas 


often in judgments or 


ratings: the larger stimuli in any seri 
ones overestimated. This has been demonstrated for judgmen 
Hollingworth, 1910), of lines (Ipsen, 1926), and of weights (Noizet, 1967: 
Woodrow, 1933), The effect can be illustrated most clearly when the 


rating scale has the same number of categories as there are stimuli, as in 


Philip's (1947) study of judgments of cards with green and blue dots exposed 
briefly in a tachistoscope. All cards had 36 dots, but Stimulus No. 1 had 13 
o. 2 had 14 green and 22 blue, No. 11 had 23 green 
presented repeatedly for 


Steen dots and 23 blue, } 
and 13 blue. When this series of 11 stimuli 
Judgment of greenness Cor of blueness) on a scale of 1 to 11, a central 
tendency appeared sys ematically at both ends (see Table 9.1). Median 
| as 2.9 rather than 1, and median 


ju ‘ > f 
l dgment of the stimulus at the low end w 
judgment of the stimulus at the high end was 9.5 rather than 11. The judg- 
on could not go lower than 1 nor higher than 11, hence the distributions 
ois § £ l 
Judgments near cach end were skewed toward the center and means show 
H central tendency than medians. 1 he category boundaries at the extremes 
the scale of judgment were indeterminate because the extreme ratings 
he middle categories. Thus the standard 
distribution of 11,550 judgments was 2.89, 
Central tendency is not limited to this 


were us 
re used less frequently than t 
deviati r E . 
i Viation computed for the entire 
as © “ cn 
Compared with 3.16 for the stimuli. 


TABLE ` g 
ABLE 9.1, MEAN AND MEDIAN JUDGMENTS OF CARDS CONTAINING GREEN 


AND BLUE DOTS 


Stimulus number l 2 3 4 5 6 7 8 9 10 11 
ei rating 33 3.7 4.3 4:7 ag 63 70 7A 78 83 90 
Median rating 29 34 4.1 as 47 63 71 77 81 8.6 95 


Dat: 
ata From Philip (1947). 
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type of rating scale, however; it has also been reported for esthetic judgments 
obtained by the method of paired comparisons (Kennedy, 1961 

Central tendency depends, of course, on the context effect. It is only when 
the stimulus series is the context of judgment that the ends and center ol 
the judgment scale can be expected to line up with the ends and center of the 
stimulus series and, therefore, that overestimation and underestimation have 
any meaning. In Table 9.1 context effect shows in that the scale of judgment 
corresponds quite well with the series of stimuli. The middle stimulus, No. 6 
gets a mean rating of 6.3, and central tendency is roughly the same at both 
ends. When such correspondence occurs, central tendency usually occurs. 


Anchoring Effects of Special Stimuli 


The context effect gives evidence that the scale of judgment is tied in : 
generalized way to all stimuli presented for judgement. In addition, a ow 
category of the scale may be tied to a special stimulus, which is then ww 
as an anchor stimulus. For example, the smallest stimulus may be er 
identified as such and alwavs judged a one, thus anchoring the judgment scale 
to the stimulus series at the small end. Both ends are often anchored hëcause 
the smallest and largest stimuli are identifiable. If a standard stimulus n 
presented with instructions that it is to be called a one, it will have the aia 
anchoring effect. Similarly, if a standard stimulus s presented with iaee 
tions that it is to be called a ten and is to be the origin from which magnitue 
estimations are made, this standard also is an anchor in that it ties a spec! 
category of the judgment scale 


to a special stimulus. talis 

The most unambiguous evidence for anchoring is that the anchor mo sih 
is judged with less variability than other stimuli. If the experimen Jct 
rangements permit a me oe 


asure of error, the anchor stimulus will be ? 
they 


with less error than other stimuli. But when anchor effects are strong 
í 2 z oj E 
spread to adjacent stimuli. V; as the ancho 


ariability and errors decre: s f the 
approached, verifying the special role of the anchor in the organization © 
scale. 


is 


nulus 
eries: 


Anchoring effects occur 


4 3 PE E > stin 
when the anchor stimulus is within the $ 
series and also when 


aa i i s 
it is extended bevond the end of the stimulus 


INTERNAL ANCHORS 


A good illustration of 


i 5 c: u of the 
internal anchoring comes from a study 
inclinations of lines (Kauf 


man, Reese, Volkmann, and Rogers, D he 
which Fig. 9.8 was drawn. The graph shows that the variability ‘ grees 
adjustments was especially small at and near inclinations of 0 and 90 deg" 
from the vertical, The same effect has been demonstrated by ag , 
the method of reproduction (1951a) and the method of estimation a9 

The vertical and the horizontal are natural anchors. 
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Standard deviation 
of adjustments in degrees 


=10 0 10 20 30 40 50 60 70 80 90 100 
Stimulus scale in degrees 
and horizontal. Variability of 


scale is tied to the stimulus series 
eese, Volkmann, and 


FIGURE 9.5. Effects of natural anchors at the vertical 
adjustments is small at 0° and 90° because the response 
fe these points. Same data as for Fig. 9.5 from Kaufman, R 
Rogers (1947), 


MIDDLE ANCHORS 

In order to demonstrate how the vertical can 
temporary rating scale, Bourassa (1961) had 21 lines, 
aa to 95 degrees from the horizontal, judged 5 times each on a scale of 0 to 
20. Taking the reduction of variability as evidence of anchoring, Fig. 9.9 
drawn from the data of 1 subject, shows negligible end anchoring but 
definite middle anchoring. In judgments of loudness by the method of magni- 
tude estimation, the siandard acts as an anchoring stimulus for the middle of 
the scale and the variability of judgment increases gradually on both sides 


of this standard (Stevens, 1956, Fig. 1). 


anchor the middle of a 
inclined at angles of 


END ANCHORS 
he anchoring effect most frequently reported is end anchoring. The ends 


of the judgment scale are linked to the ends of the stimulus, with the re- 
sult that judoments of the end stimuli are made with fewer errors and less 
Variability ian judgments of the middle stimuli (Volkmann, 1951). The 
Cffects spread ead hence it is reasonable to assume that the stimuli near 
each end are ‘dyed dn relation to the end stimuli, and that the scale as a 
Whole is Gieaniwed from both ends toward the middle. End anchoring is most 
Noticeable in indomenis made on a rating scale of fixed categories, such as 

to 7 of <3 a a es negligible in judgments on unbounded scales, 
When Fel e oh aie middle of the stimulus series and the scale is 
Organized from the middle outward (Weiss, 1963a). Something similar to 
end anchoring occurs in rankings, howevel as noted above, if we take the 
°rder in which the judgments ait made as evidence of the order of organizing 


© scale, 
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Judgments 


85 87 89 91 93 95 


Stimuli in degrees from horizontal 


FIGURE 9.9. Middle anchoring without end anchoring. One subject judged the i 
inclinations of 21 lines, 5 times each, on a scale of 0 to 20. The vertical lines show the 
range of judgments, and the dots show the means. Middle anchoring appears in 

the reduction of variability near 90°. Data from Bourassa (1961). 


: i) d ry: ‘oportions by 4 
End anchoring appears clearly in judgments of linear proportions >) 


reproduction method also. Nash (1964) presented his subjects a veiis 
line, about 8 inches long, and asked them to mark the deciles, as by in| 
an X at the 90 percent point. The results showed that the scales were ee 
at both ends, because the 10 percent and the 90 percent points were mat j 
with high accuracy, and also the middle, because the 50 percent point SE 
marked with high accuracy. The greatest variability was at 30 and 70 perce! 


SALIENCE OF ANCHORS re ends 

Why do the ends exert such special effects? Is it because they are a 
as such or because they are salient? Eriksen and Hake (1957) eliminated a 
by the use of colored Papers in an endless continuum and found that t® se 
subjects then selected other salient features, such as the end of the ee fot 
scale, as anchors. After considering alternative explanations, they aati to 
the hypothesis that the subject selects any salient feature that helps ait nt 
stabilize his judgments and interpreted end anchoring as a special case 
this. jmulus 

End anchoring, though common, does not occur at the ends of all ate and 
series. Although the subject is told the stimulus dimension to be judged š 
probably assumes that the stimulus series is a limited segment of this din pe 
sion with two definite ends, the identification of these ends comes se ers: 
perience with the series and some ends are more easily identified than i 
One systematic study of end anchoring (Johnson and King, en oreen 
slides of 36 green and blue dots, ranging from 0 green (36 blue) to oe Oi 
C0 blue), projected on a screen, because this combination of ee to 
yields linear data. Seven sets of 1] slides each were drawn from this "The re 
form different stimulus series and were judged on a scale of O to 10. 


en 
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sults, in Fig. 9.10, show that each monochromatic end acted as a strong anchor, 
but in Set 4, with ends far from the monochromatic ends, no anchor effect 
appeared. Strong end anchoring may be expected when and only when the 
end stimulus has some salient property that identifies it as an end beyond 
which there are no other stimuli. Weaker effects may be expected when the 
ends are less salient. Fig. 9.10 also shows the central tendency of judgment 
(Set 4) and the reduction of central tendency by end anchoring, with a 
concomitant increase in overall accuracy. 

Adding value to a stimulus object may have several consequences, but one 
of interest here is that it makes that stimulus more salient. Tajfel (1959) had 
a ten-category scale, but when he gave a bonus 
d or the other, he found that adjacent 
throughout the series and the overall 
as not due to a general increase in 


a series of weights judged on 
for judging the weight at one en 
stimuli were separated more clearly 
range of judgments was extended. This w 
Motivation, because the effect did not occur when bonuses were given for 
Stimuli at random. According to Tajfel, value at one end of the series made 
both ends more salient. 

The salience of an end stimulus depends 
stimuli, End stimuli in series of lengths and in series of descriptions of 
aggressive behavior were judged with lower variability when they were the 
ends of a widely spaced series of eight stimuli than when the same stimuli 
were the ends of series of eight stimuli of half the range (Miller and Bieri, 
1965). And if the judges are led to believe that the ends may change, the 


also on the spacing of the series 
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choring. Seven sets of 11 stimuli each were 


j re shown for each distribution of 60 
Judged or Sais Ant nartiles are i j 
mu mena Ee 1 gee ee stimulus at one end of yrs stimulus series, 

i . Sets 1, 2, 6, and 7, 5 z f A $ 
Show definite end anchoring. Set 4, without identifiable end stimuli, shows no 


a P 
”choring, From Johnson and King (1964). 


GURE 9.10. The conditions of end a" 
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= . Anch 
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A a < 
= 
S 4 900 -grm. Anchor 
5 FIGURE 9.12. Effects of remote anchors 
v3 on weight judgments. In relation to the 
3 2 control condition with no anchor, a i 


90-gram anchor raises all judgments, 0U 
especially at the low end. A 900-gram _ 
o anchor lowers all judgments, but aie 
i f 
200 250 at the high end. The curves represe? 
. 209 So AOD equations derived from oe 
Stimuli in grams theory. From Helson (1964). 


ally 


perceived as columns; the matrices were perceived as rows when a column 
anchor was presented. Other experiments of the same design demonstrate 
similar effects on the Gestalt principles of similarity, good continuation, en 
closure. f i 

If the anchor stimulus is extended too far beyond the series stimuli, its “nd 
fluence breaks down (Helson and Masters, 1966; Sarris, 1967); either a 
cause it does not receive much attention, or because it is not perceived a É 
part of the series, perhaps not even as relevant to the task. When Sarris m 
his subjects judge weights from 200 to 400 grams, with anchors ranging en 
12.4 to 4,500 grams, he obtained the usual anchoring effects, increasing in 
distance from the series stimuli, for anchors between 90 and 3,000 grams- 
outside these limits the effects of the anchors decreased. ori’ 

The most extensive data on variations in anchor effects come from on 
ments on pitch judgments by Sarris (1969), using rather narrow te 
frequencies and a wide range of anchors. For the middle curve of Fig: cles 
for example, the stimulus series extended from 500 to 600 Hertz, O! a 
per second. Anchors as low as 200 pulled the scale down—as measure 
the adaptation level or central value of the scale—and anchors as high 3° ta- 
pulled the scale up. Hence within these limits the relation between adap - 
tion level and anchor is linear. But beyond these limits anchor effects 
crease at both ends, so that the effects of anchors below 50 and above ~” the 
cycles per second were negligible. Similar inflections are apparent i 
other two curves of Fig. 9.13. ffe 

The pure effect of an external anchor, uncomplicated by other it the 
comes out when all stimuli, including the anchor stimulus, are judged me 
Same way and the scale adjusted to include the anchor. For the ae 10) 
illustrated in Fig. 9.14, the stimulus materials were chosen (as for PiS d be 
so that the relation between response scale and stimulus series WOU 


i t 

l j % , that 
co The judgments made by five independent groups show just 
cale is adjusted as a whole to a series that includes a remote anchor, 9.12 


it i i : s Fig 
is adjusted to a shorter standard, or unanchored, series. AS iP °°? 
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from Helson, the difference between the judgments is greater at the end 
of the standard series which is closer to the anchor stimulus. 

The effect of an external anchor can be called a contrast effect, since the 
end stimulus might be small relative to the standard series and large in con- 
trast to the anchor stimulus. Or, to describe the effect more generally, when 
the scale of judgment is adjusted to the standard series, the end stimulus is 
at the end of the scale. But when the scale is adjusted to include a remote 
anchor stimulus, the end of the standard series is no longer at the end of 
the scale and receives a less extreme scale value. 

When the anchoring experiment is simplified in this way, it is apparent 
that one function of the anchor is to extend the context of judgment; if the 
anchor stimulus is judged, the anchoring effect is a special manifestation of 
the context effect. But an extended anchor is also a very salient stimulus, 
Precisely because it is extended beyond the others. Hence variability of judg- 
ment of the anchor stimulus is usually negligible. 


Since the effect of an unjudged anchor breaks down at extreme distances 
fect of a judged anchor also break down? 


ct to judge the anchor may 
h the other stimuli that are 


from the other stimuli, will the e : 
One can only speculate, but requiring the subje 
make it seem less remote, perhaps uniting it wit 
the objects of the same judging operation. 
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inear, exhibits the familiar anchor effects; saak AE") 
ae ed in either ees inflection appears ne <a effects are 
ero. From Sarris (1966), permission Verlag für Psy D 
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FIGURE 9.14. Judgment, z 
of a standard series of weie its 
and 4 anchored petit 
oint represents the mean 
b4 judas The ee 
in each graph shows the oe 
regression equation eo 
‘om all judgments. f 
pee line of the standard 
series is reproduced as 4 
broken line on the other 
= " ‘i 3I 35 graphs. From Johnson ane 
Stimuli Mullally (1969). 


Series 35 


IMAGINARY ANCHORS 


The anchor stimulus need not be a stimulus that the judge perceives: vai 
has been demonstrated incidentally by many experiments directed OWA ee 
other Principles. When Blumenfeld (1931), in one of the earliest experime” a 
on anchoring, had his subjects judge the irregularity of scattered dots on 2 
card, the judgments indicated that the subjects took zero scattering all Pe 
lying in a straight line) as an anchor even though they did not see such . 
card. Hunt and Volkmann (1937) observed that judgments of the pee 
ness of colors can be influenced by reference to an anchor in the form oh 


x k act a5 
most pleasant color one can think of. The vertical and the horizontal a 
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anchors for judgment of lines even though no vertical or horizontal line is 
presented. And in the Johnson and King experiment Cpage 402), the anchor- 
ing effect of a monochromatic slide was apparent even when this slide was 
not included in the series to be judged. It is easy for anyone who sees a 
slide with two green dots or one green dot to imagine a slide with no green 
dots and use that imaginary slide to anchor his scale. 


Realignment of the Scale of Judgment 


We have seen that the alignment of the scale of judgment with the series 
of stimuli is influenced by the stimulus context and by special anchor stimuli. 
It follows that the seale can be shifted up and down, or realigned, by manipu- 
lation of these stimuli; such shifts have also been called context effects and 
anchoring effects. The term contrast is used in addition because a stimulus 
that is judged small in contrast to the first series may be judged large in 
contrast to the second. The resemblance of experiments of this kind to ex- 
Periments on learning is obvious. 


JUDGMENT AND LEARNING 
Judgments are learned. This is one feature that distinguishes judgments 
from reflexes and changes in a scale of judgment from sensory adaptation. 
Unlearned components are involved in perceptual and affective judgments, 
ut the Judgments are communicated from one person to another by a 
System of response categories the meaning of which has been learned in 
other social situations. The communicative significance of each response 
Category of a temporary scale of judgment develops quickly, and the typical 
Judgment experiment, in the psychophysical tradition, presents the similis 
Series a fei times for practice, then discards the practice data. The judge is 
©Xpected to achieve a stable adjustment to the situation and to the stimu 
Series rapidly, so that the data from subsequent series can be combined. The 
adjustment does occur quickly if the stimulus series is constant, but it does 
“epend on practice in judging the first few series. P 
© say that judgments are learned is a very general statement—as genera 
f Jug u ot as helpful as one might ex- 


s usually ignored the learning 


as og TE ati 
s the definition of learning itself—that is n 


Pect. Res : : learning ha 
© Research a ssion of learning ae 

OF scales a and discus i A k is ambiguous, the definition of correct 
es. Since the subject's task i s 


a ponse, or of confirmation, is also ce: f agen a p eel an 
enes is shifte eating the adjustmen š : 7 i 
Series of i irons while it does not explain payning, E a 
Variables that can be examined, hypotheses ihat rat ve soit oo - 
Conditions that can be included. The possibilities of ge c 
tice wil] be considered later under the haiga apen ie a ti Te 
Ne basic question is how anyone, TS of lea ing a new 
Stimulus context, readjusts to another. It is not a qein of le m g W 
imension bit of a quantitative shift along a familiar dimension. If someone 
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is accustomed to thinking of $1 as the limen between an spenden SERS) 
and an inexpensive lunch, how does he realign his scale of values re a 
inflationary rise? What will happen to a student's scale of the one ne 
offenses if he loses contact with the peccadilloes of college life and go 
ork in a state prison? es ee 
Y The most peat picture of a realignment comes from examining, judg 
ments of all stimuli. But when comparisons are made between pre- and. pi : 
shift judgments, the picture becomes complicated; hence several simplifications, 
depending on the feature of most interest, have been introduced. san. if 
The shift can be followed continuously and plotted with some aan 
the stimuli are judged on a two-category scale, so that the limen between $ 
categories can be computed as the central value that locates the scale e 
tracks its movements in stimulus units. When the scale has three or 
categories, the mean of the stimuli assigned to the middle category, ° 
labeled medium, can be treated as a central value. m) 
When judgments are made on a numerical rating scale, least-squares Pe 
cedures provide a central value or scale origin for each block of judgme uli, 
For some purposes, it is not necessary to analyze the judgments of all ence 
Shifts of a rating scale can often be studied merely by following the "° ae 
of what Krantz and Campbell called a tracer stimulus, that is, a stimu i f 
is included in all series but may be, as the series change, at the high en 
one and at the low end of another. aiet 
It is a reasonable hypothesis that the lag in adjusting to a new si aT oe 
related to the amount of experience with the old series, and in waa this 
earliest experiments of this type Tresselt (1947) was able to con H kas 
hypothesis for weight judgments. Since that time, most of the resear 


: . z impede 2& 
been directed toward those variations of the preshift series that imp 
justment to the postshift series. 


LEARNING CURVES . in two cate 

When the stimuli of the old and the new series are judged in describe 
gories, so that one category limen locates the scale, the shift can be case O 
in terms of movement from the old limen toward the new. In pri in 
judgments of the pitch of sounds, we know from the research P ș of the 
Fig. 9.7 that these limens are approximately at the geometric ont est 
stimulus series. If we designate these two limens as Lı and Le, t whe Jimens 
expression describing the shift would be a weighted average of the tw 


Johnson, 1949b). 
i wL, $ nL 
. wen 


P timu 
A n n aries of s 
In this equation, n represents the number of times the second seri y 


, and bi 
has been presented for judgment. L» is weighted by n, therefore, ane 


A sa im 

sai z R ift series I? 
w, an empirical constant indicating the weight of the preshift en i the 
peding adjustments to the postshift series. L, is the category series: 


A z i e new 
time of the nth practice trial, that is, the nth presentation of th 
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The expression could be considered a moving weighted average. It begins at 
L, and takes the form of a hyperbola with Lə as an asymptote. 

When experiments were arranged to test this equation, the curves drawn 
in Fig. 9.15 were obtained. They show that a hyperbolic function is not in- 
consistent with the facts, although the data are not sufficiently regular to 
tule out other similar functions that approach an asymptote. More important, 
these graphs show that w, the constant representing the weight of the first 
series in retarding adjustment to the second, increases directly with an in- 
crease in the number of preshift trials. Similar effects of number of preshift 
trials have been reported by Di Lollo and Casseday (1965) for weight judg- 
ments, 

The union of the judgment tradition and the learning tradition raises 
questions about the proper name and treatment of their offspring. Is a trial 
the judgment of a stimulus or is it the judgment of a series of stimuli? Only 


YL 
p Øy 


w=0.65 
Fig. 3 
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judgments of complete series are comparable, but considerable irp eee 
occur during judgments of a series of five stimuli. Rambo anc Di a 
C1964) therefore defined a trial as the judgment of a single mn A 
presented the stimuli in different orders to different subjects so t mt T 
stimulus was judged first by some subjects. Thus a central value of the m 
could be computed for each trial, beginning with judgment of the a 
stimulus. In two experiments on judgments of dots by this more precis 


A 5 a ~ rse of 
procedure, they found that a hyperbolic equation described the cours 
stabilization of the scale quite well. 


VARIABLES THAT INFLUENCE LEARNING . E ection 

The familiar question, What is learned? can be raised in this sae ie 
also; Parducci (1954) began with the hypothesis that it is mainly tan oa 
tribution, and redistribution, of stimuli that is learned. In an gr i eto 
judgment of distances, one group judged a block of short distances, then al 
distances, and finally the block of short distances again. The experim® a 
group judged the same distances in the same order, but they were an 2 
in advance about the range of distances to be presented. Consegue fast 
expected, the experimental group showed less shift between the first an 
judgment of the short distances, 

Another effect familiar in learning rese 
up in Parducci’s experiments on ancho 
squares and the height of cardboard men 
influenced more by immedia 
earlier, 


arch, the effect of recency, ae 
ring of judgments of ee a 
; judgments of standard series id 
tely preceding anchors than by anchors judg 


Most experiments on seri 
tracted an e 


as 
range of the stimuli, the frequency of apa 
and the middle of the series, by incidental lea ved 
dimension. The effects on the dimension ca 
ere not as large, of course, as the effects on the dimens 


Some series were skewed—, 
ing while judging another 
incidentally w 
judged. 

These trends wer 


; by 
i e confirmed and additional controls were included 
Di Lollo (1964) in a systematic ex i 


shifted from a heavy series of weights to a lig > 
Z inued judging the heavy series in both training 
phases. A third gro 2 


ments, for both groups that shif 
with the appropriate control i 
but did not vanish; the convergence can be attributed to recency and the /4 
of complete convergence to primacy. 

As interest in shifts of judgm 


are 
$ ompa 
ted, showed considerable contrast as COMP 


ase 
groups. These differences gradually dmn c 


«ions 
ditio” 
ents has grown, more and more con 
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influencing such shifts have been reported. Subliminal anchors seem to in- 
fluence judgments of the intensity of electric shocks (Black and Bevan, 
1960) and of the loudness of tones (Bevan and Pritchard, 1963a). In judg- 
ments of the size of circles, imaginary anchor stimuli are about as effective as 
anchors actually seen (Magaro, 1966). A two-hour lecture about contrast 
effects in judgments reduced anchoring effects in weight judgments, as did 
hypnotic suggestion to ignore the anchor (White, Alter, Snow, and Thorne, 
1968). Although several narrow hypotheses have been offered for the effects 
of such anchors in retarding adjustment to a new series of stimuli, the diver- 
sity of the conditions investigated suggests that any condition that influences 


perception of a stimulus can influence the effectiveness of that stimulus as an 


anchor. 


VARIATIONS IN THE RESPONSE SYSTEM 

Campbell, Lewis, and Hunt (1958) observed that experiments on realign- 
ment of the scale of judgment utilized temporary rating scales specific to the 
laboratory and bounded at both ends. They therefore displayed a simulated 
Piano keyboard, well labeled, for their subjects to use in making pitch judg- 
ments of series of musical notes. The diagrammatic keyboard represented the 
full range of the piano, while the stimulus series were located in the middle 
Tange; hence the subjects’ choice of response language was not restricted at 


either end. Under these conditions, the mean judgments of a tracer stimulus 
‘tional series, and in the last 


included i ‘es of stimuli, in transi 

Series Tonite ade due to shift of context. It was thus demonstrated 
that these shifts are not peculiar to the temporary bounded scales previously 
used, 

i Krantz and Campbell (1961) then tri 3 
engths of lines with two other response systems: a temporary rating scale 
With a center at 100 and undefined ends, and a well-learned standard seale, 
the scale of inches. One series consisted of lines from 6 to 20 inches e» 
another of lines from 20 to 36 inches; for different groups of pre ir 
Context was shifted in opposite directions. The results as t X 
Judgments of a tracer stimulus Cof 20 inches) in terms of the ars wp e 
Were strongly influenced by the preshift context and shifted considerably as 
the context was shifted. in either direction. Judgments of io m 
in inches were less influenced by the preshift ping and t iis sma effect 
Was reversed after the shift. Apparently, the brief la el Sr oni has 
à greater effect on the temporary rating scale than on the thoroughly practiced 


Scale of inches. 


ed to get shifts in judgment of 


everal other systems of response categories have been tried out and found 
to be vulnerable to shifts from one series of stimuli to another, some, of 
os hers (Fillenbaum, 1961; Harvey and 


Course, bei han ot 
Campbell, 1963, R egies Tolle, 1968b; White, 1964). There are no 
absolute scales of dget Any correspondence between judgment cate- 


Bories and stimuli that can be Jearned can be relearned, and the same 
Conditions that influence the 0 riginal learning presumably influence the re- 
d 
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learning. Thus the safest generalization at present is that temporary sen 
learned in the laboratory are more easily shifted than thoroughly learn 
scales, such as the standard scales of inches, ounces, and degrees. In wel 
ticular case, the amount of shift will be moderated by internal anchors, 
imaginary anchors, and other effects mentioned above. o 15: 
Although the consequences of shifting the series of stimuli ive a 
analyzed in some detail, little work has been done on shifting the al 
response categories in which the judgments are expressed. If it is true = 
the learning consists mostly of grasping the series of stimuli, a switch i = 
alternative response system would go smoothly. That is, if a subject = ; 
to judge a series of stimuli on a scale of 1 to 5, he should be able to ju a 
it on a scale of 1 to 9, or —3 to +3, or by the method of magnitude “ee 
tion with little difficulty. Apparently such shifts have not been studied ; on 
Weiss and Ten Eyck (1960) demonstrated that numcrical and onto 
sponse systems were indistinguishable and that Cin its influence on the 


è f as not 
use of a five-category scale) experience with i 


a three-category scale w 
different from experience w 


ith a seven-category scale, 
INDIVIDUAL DIFFERENCES 


iness 
Many investigators have commented on individual differences in ea 
to shift, and some have attempted to relate these to other individual owt 
ences. The measures may not have the reliability necessary for ont 
between scores for individuals, b i nee 
case, the judgment of pitch, in which subjects with considerable yet 
can be separated from other subjects, and here it is clear that ee abs 
trained subjects are not as susceptible to shifts of context as untrained 


jects (Campbell, Lewis, and Hunt, 1958; Schneider, 1963). 


but groups can be compared. The 
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The purest context and an 
along one dimension 
condition is convenie 
Special case that sugg 
havior. Recent modifi 
questions and have t 
judge may direct his 
to the Neglect of ot 

e context of judg 
of attention and 


} “aao f 
S. televance interact, to be sure; the judge is likely 

to stimuli that are rel 

attend to will appear 


AN EDA o clari 
b imi televant, But it is possible to achieve some sai 
i » A z 1 
Agee the €xperimenter’s operations. Without varying the s 
subject's attention can } 


. y 
chor effects are obtained when all n 
and all are judged in the same way. Thiig S very 
nt for many experimental purposes, but it rdge’s be 
ests a spuriously passive description of the a cresting 
cations of the standard condition have raised —, á 
urned up new information on the judging proces "some 
attention evenly to all stimuli, or he may age ae to 
hers. He may group all stimuli together as Te 


tors 
n ese fac 
ment, or he may exclude some as irrelevant. Th atten 


cted 1o 
fication 
li, the 


ATTENTION AND RELEVANCE * 415 


be judged while others are not. Without varying the instructions, the rele- 
vance of the stimuli can be manipulated by stimulus arrangements. 


TASK INSTRUCTIONS 

It is reasonable to suppose that the attention directed to stimuli that are 
judged is different from the attention directed to standard and anchor stimuli 
not judged. This question arose early, since quantitative models of compara- 
tive judgment have to make some assumption about the relative weight of 
stimuli and standard. Data on comparative judgments have been matched by 
assigning the standard stimulus one-third to one-half the weight of the 
variable stimuli that are judged (see page 397). The data for remote anchors 
are not so clear, and, of course, such estimates of relative weights cannot be 
detached from the other assumptions of the model. It is evident, however, from 
more direct comparisons, that an anchor stimulus that the subjects are in- 
structed to judge has more influence on the organization of a scale of judg- 
ment than one they perceive but do not judge (Brown, 1953; Magaro, 1966). 

Task instructions also influence the learning of a distribution of objects of 
judgments, especially the differentiation between incidental and intentional 
learning. An experiment (Parducci, 1956b) on this point mentioned on 
Page 412 involved judgments of dot-filled squares varying in size of squares 
and number of dots, Instructions from the experimenter directed attention to 
one of these dimensions and whatever was learned about the other was 
learned incidentally. The subjects did learn something of the nature of the 
distribution of the objects on the incidentally observed dimension but, as ex- 
pected, not as much as on the intentionally judged dimension. 


STIMULUS RELATIONS 

An early discussion of affective judgments by Beebe-Center (1932) te- 
stricted what is here called context effect to those stimuli that constitute a 
unitary temporal group, and subsequent investigations have tried to make 
this phrase more explicit for both context and anchor effects in perceptual 
judgment. It is not surprising that two important factors are factors of per 
ceptual organization that have been emphasized by Gestalt psychologists: 
distance and similarity. 
As to the importance of distance, the section on external anchors noted that 
when anchors are too remote, their influence as anchors decreases (see Fig. 
2.13). Extremely remote anchors apparently are not grouped with, or per- 

ceived as part of, the series stimuli at all. ay 
As to the importance of similarity, several experimenters have observed t A 
during an experiment on weight judgments the subject may pick up a Ta 
°t move his chair without noticeable influence on his judgment of the ex: 
Perimental weights. Brown (1953) checked on this matter of similarity by 
aving his subjects lift metal trays as well as the usual laboratory weights. 
© metal trays did have an anchoring effect, but it was smaller than the 
anchoring effect of weights homogeneous with the stimulus series. Likewise, 
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Parducci (1954) found that anchor stimuli of the same color as the series 
stimuli had more influence on size judgments than anchor stimuli of a dif- 
ferent color. Bevan and Pritchard (1963b) introduced several more variations 
in a study of shape judgments. When black rectangles were judged, the 
usual anchor effects appeared if black rectangles of approximately the same 
size were used as anchors. But no anchor effects appeared from circular fig- 
ures, ellipses, rectangles much larger or smaller, or rectangles that differed 
considerably in lightness. Then Tresselt (1965) raised the question whether 
similarity of the stimuli or similarity of the task is more influential. Before 
judging a standard series of l f 
preshift experience with five h 
and with five books of the 


aboratory weights, four groups had different 
eavier laboratory weights identified by letters 
same weight. The effect on postshift judgment 
of the standard weights was greater for preshift experience with the weights 
than the books, but the difference between preshift experience in alpha- 
betizing and in ranking was negligible. All in all, these results justify the 
generalization that the most effective anchor stimuli are those that are similar 
to the series stimuli except for variation along the stimulus dimension. De- 
eect from this ideal condition reduces and perhaps eliminates the anchor 
effect, 


. . . + . i stigi i p 
Another approach to the question of similarity is through the investigatio 


A “ly 
of intermodal anchor effects, Judgments of temporal intervals are especially 
convenient for this 


; Purpose because the intervals can be marked off by 
stimuli of diferent modalities. Intermodal anchor effects have been reporte 
for auditory and visual intervals, but they are smaller than intramodal 
effects (Behar and Bevan, 1961). When auditory intensities are subjectively 
equated to visual intensities, preliminary experience with auditory anchoring 
intensities, either higher or lower, has substantial effects on the equation 
(Smith and Hardy, 1961); but in this kind of judgment also the intermoc@ 
anchor effects are smaller than the intramodal (Bevan and Pritchard, 1964). 
And these intermodal effects are extended by the possibility of intermanu? 
effects, for Dinnerstein (1965) has reported that lifting an anchor weig i 
with the left hand influences judgment of weights lifted with the right han. 


3 A ne 
Tf Gestalt factors of distance and similarity influence anchor effects, © 


might expect that the background of the stimuli would also be important. s 
background of the se 


ore ries stimuli is a factor in judgments of the hay 
quares (Engel and Parducci, 1961), and the background of the ancho ffect 
Sments of lightness (Bevan and Turner, 1965). But the ¢ dis- 
S not automatic; it is influenced, like other effects “a 
Variations in instructions. When an effort was made to nie ; 
Oring of a series of 16 papers graded from white to = , 
dium gray background, to be judged on a scale af : t, but 
choring appeared, since white and black are salient, 


the backgro i eT 
und had no i i rere given that t 

ground Fanna effect. When instructions w ere gi btained. The 

0. 9, however, clear evidence of anchoring was 0 2.51 

standard devia 


tion of the judgments of Stimulus No. 8 decreased POT O: 
-9 From 2.26 to -23, and No. 10 from 2.74 to .67 (Johnson, 
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STIMULUS COVARIATIONS 

When squares varying in size contain dots varying in number, attention 
can be directed by instructions to either dimension. In the experiment by 
Parducci (1956b). mentioned above, the variations in the two dimensions 
were unrelated: but Rambo (1962) arranged stimulus cards on which these 
Variations were positively correlated, negatively correlated, and unrelated. On 
the preshift task, a control group was instructed to judge the size of the 
Squares; other groups were instructed to judge the numerousness of the dots. 
The postshift task consisted of judgment of size of a smaller series of squares 
without dots. The largest effect on postshift judgments followed preshift ex- 
Perience in judging the size dimension, of course. But preshift experience in 
judging numerousness (both positively and negatively correlated with size) 
also influenced postshift judgment, more so than preshift experience with un- 


correlated dimensions, Rambo’s interpretation, which was phrased in terms 


of mediated associations and response competition, stressed the consistency of 


stimulus covariation, whether positive or negative. - 
hen the stimulus materials become complicated, or multidimensional, 
the subject’s attention can vary, not only from maximal to minimal or from 
intentional to incidental, but by intermediate degrees. So Winters arranged 
4 similar experiment (Winters and Johnson, 1969) with the hypothesis that 
the effect on postshift judgments of size would depend on the amount of 
attention directed toward size in the preshift task. Squares varying ans126;/6On= 
taining dots varying in number and brightness, were presented in the pre- 
Shift task; the pos shift task consisted of judging square size. Judging square 
Size in the preshift task had a large effect on the judgment of square size in 
the postshift task, as would be predicted by any hypothesis; but the effect of 
Judgment of dot numerousness depended on the correlation in the stimulus 
Materials, When the two dimensions were not correlated, preshift experience 
in judging dots had a large effect, presumably because the Se had to 
attend to square size noigh to discount that dimension and judge coreg 
Ness accu rately, But when these two dimensions were Lega, corre ated, the 
“ect was smaller presumably because the task required little attention to the 
redundant dimension of size. The effect of ed ae Diy 
the bri i gligible, as precictec, ask di 
Gt sage any een eae e . 
_.n these experiments, all dimensions of the stimulus one yt visible; the 
ifference in effects on postshift judgment were produced by variations in 
the instructions for the task and in the relations between the dimensions. 
though alternative interpretations are possible, the attention hypothesis is 
Parsimonious if we merely assume that the subject attends directly to the 


'mension he is told to judge while attending to Dooe dimensions in 
: task. 
Proportion to their aid in performance of the central 


n general f evi 
al, £ ray of € a aai fs 
» from the array te indirectly, can influence con- 


Variab] i ; 
es that i perception even q í 
influence 3 h experiments offer good techniques for 
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in affective 
studying such indirect perceptual effects; but effects pee pm i of 
and abstract judgments as well can be treated as due to g 
j ` . anai into 
PE ta of intermodal variations and stimulus payee 
experiments on context and anchor effects brings us back to ce n> hag draid 
sional judgments of the preceding chapter. Judgments ong oe ee ie 
mensional often are multidimensional. The methodological = a veré Ji 
in the research summarized in the preceding chapter, the RER aidi- 
structed to judge the composite; while in judgments that are = accel 
mensional, the subjects are instructed to judge one dimension—t nee pee 
the other dimensions are incidental or indirect. Nevertheless, we s rA ae 
the most serious attempt to account for these indirect effects, poppet 
tion of pooling, has some similarities to the adding and averaging pe eh 
of Chapter 8. These two traditions will probably merge in due 


Contrast and Assimilation 


The effect of an external anchor may be called a contrast eon series 
end stimulus that is judged very large in the context of the i stimulus: 
may be judged moderately large in contrast to a larger remote civ aaah o! 
Judgments of the stimuli at the anchored end are most strong Pt is mani- 
course (see Figs, 9.12 and 9.14), but there is a general effect th the mean 
fested by the mean judgment of all series stimuli. Within gh effect 
judgment varies inversely with the value of the anchor Ha aca tabulate 
also appears in frequency rally used 
without regard to the stimu Jaced by 
for the end stimuli, a trate 
the external anchor. struc” 
when the end of the 


also when the 
tions (Rogers, 194]; Sherif, Taub, and Hovland, 1958) and 1 instructions 
anchor stimulus is included for judgment without specia 
(Parducci, 1954; Postman and Miller, 1945), 


; m ishe! 
The contrast effect, by one name or another, is well establis! 


imilation & 
Taub, and Hovland (1958) also reported evidence for an assimila! ements 
which is smaller and | 


nstruc” 
they presented an anch tside 
tions to call the anchor 

the regular series of sti 
the median judgment 

anchor. But w 
Series, the me 
without the 

from external 
assimilation 
complex assi 


ce an 


of use of each category of judgmen 
li. The end categories of judgment, aes 
re less frequently used when the scale is dis ae 
The contrast due to anchoring has been an ‘a 
scale is tied to the anchor stimulus by specia 


Adit Sherif, 


ffect, 


ess stable. In an experiment on weight De 
or stimulus before each series stimulus Ki r, ou 
a six. When this anchor was a heavy ies, that is 
muli, it produced the usual contrast e without the 
of the series stimuli was lower than ulus of the 
hen the anchor was the same as the heaviest hat higher than 
dian judgment of the series stimuli was somew ADT e 


‘gate 
nvestigat 
more 


anchor. It is this effect, opposite to the ian z 
anchors, that is called assimilation. These a eee the b- 
in weight judgments with the hope of exp = when the sub 
milation reported in judgments of social objects, 
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ject’s own attitude may act as an anchor. Assimilation in weight judgment: 
is questionable, however; for, when Bravo and Mayzner (19615 aie ie 
experiment, they found the usual contrast effect but no assimilation Parducci 
and Marshall (1962) did find assimilation in weight judgments under an 
tain conditions—for heavy anchors but not for light iene interpreted 
assimilation as an indirect effect of the more familiar contrast, with the 
anchor serving as a standard for a comparative judgment. Bevan and Turner 
(1964) found evidence for assimilation as well as contrast in judgments of 
on enclosed in square frames. The materials and experimental manipula: 
sot Na fry ag, algae hs gd are e a perception, and the 
> a P es rather than to variations 
in the scale of judgment. We must admit that assimilation effects are transi- 
tory phenomena, not well understood at present. These effects are not exactly 
the opposite of contrast effects and certainly they are not as predictable. f 


Theoretical Formulations 


Po gment data have been treated quantitatively from the outset, for de- 

Scriptive purposes at least. But quantitative formulations of more theoretical 

intent have also been elaborated—from different points of view, with varied 

ae and weaknesses. The models outlined below are not alternative 
ormulations of the same data; they overlap only in part. But all attempt to 

APAD some of the results already summarized above in a more empirical 
anner. 


ADAPTATION LEVEL 
ee well-known concept of adaptation level was introduced by Helson 
eal 1948) to account for complex data on color vision and was later ex- 
fact. toa wide range of data on judgment and other topics (1964). In 
ike » an appreciable share of the research reported in this chapter grew out of 
a ii aroused by AL theory. Although adaptation 1s treated as a gen- 
Psychological characteristic, not restricted to sensory adaptation, and in- 


cludes . 4 i i 
ides the pooling of diverse influences on the organism, the term level has 
zero point or origin to which gradients 


udgment are treated as funda- 
he level to which the organism 
d in reference to this 


A ye meaning. It represents the 
ene, eae are referred. Thus scales of j 
is ada oe the neutral category being t 
es pted and stimuli on either side being evaluate 


NEUTRAL POINTS f 
hen several stimuli, X;, are judged one at a time, the adaptation level, or 
f the stimulus series, according to 


ag » approaches the geometric mean © 
aptation-level theory. 


logCAL + .75d) = logXi 
n 
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The geometric mean is used rather than the arithmetic because Helson, ~ 
lowing Fechner, assumes a curvilinear function relating stimulus sage 
and magnitude of effect. The second term, .75d, which is introduced to - ` 
account of spacing of stimuli and order effects, varies with the stimulu 
dimension and the experimental conditions. he 
For the simple case of two-category judgments, the limen herween f i 
categories, the value of the stimulus that is judged equally often in a 
categories, is taken as a measure of the neutral point to which the organi) i 
is adapted. The theory predicts that this value will coincide with the adapta 
tion level computed as above from the stimulus series. PEE 
When judgments are made on a rating scale of several categories, sat 
middle or medium category is taken as the neutral region, and stimuli b : 
are rated in this category are presumed to be at the level to which the — 
ism is adapted. Thus, if the rating scale extends from 1 to 9, the values € 


> adaj l i : » expected 
the stimuli that get ratings of 5 are the obtained values which are expec 
to coincide with the theoretical AL. 


SCALE VALUES OF STIMULI imulus 
ccording to the theory of adaptation level, the judgment of each oo 
is a function of its distance from the AL, which is more conveniently sy A 
bolized by A in some expressions. This distance in physical units is Xi pr 
but the psychological unit is the just noticeable difference, the size of w ee 
depends on both the stimulus being judged and the adaptation level. Her 
the judgment, J, is predicted by the followi 
J= K(X; — A) 
Rea 


ng equation : 


, g jor ii ressed 
Kisa constant, related to the Weber constant. The judgment, J, 18 eP this 
on a bipolar scale, such as =2) =I, (0, S a hence, according t° cra 
f : G , pera 
equation, when the stimulus coincides with the adaptation level, the nu" 


5 =" imulus 
tor will be zero and the judgment will be zero, or neutral. When the stim 


. j ‘ than 
is lower or higher than the AL, the judgment will be lower or higher t% 


2 : notice 
r because the magnitude of the just his 


. 7 and 
4, 5. The theoretical AL is estimated lure 
udgment data by a least-squares proce¢ a in the 
: Pi . ip 
five weights on a scale of 1 to 9 is show 


-æ data 
ken from Helson (1964). The AL for ileak 


APPLICATIONS 


; ur 
aptation-leye] theory is Particularly appropriate to the quantitative on 
lation of context effects, and many of the phenomena described above þe 
been interpreted by AL th gani iag 
comes adapted mp: s is 
of the effects o pegs 


cory under the assumption that the or, 
to a level or point on the stimulus dimension ai e 
f the Separate stimuli. Since this pooling of stimulus 
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represented mathematically by the averaging of stimulus values, the quantita- 
tive formulation has been applied without special difficulty to the results of 
experiments on judgments of skewed distributions of stimuli also. 

If the AL is represented by an average, the stimulus values that are pooled 
can be differentially weighted on the assumption that some contribute more 
strongly to the adaptation than others. In the early work on judgments of 
visual stimuli, for example, the background was given three times as much 
weight as the figure, or focal stimulus. Likewise, when AL theory was applied 
to experiments with external anchors, anchor stimuli were included in the 
averaging, but with different weight from the series stimuli, and thus the 
theory accounted for the usual contrast effects without additional assump- 
tions (see Fig. 9.12). Whenever the series stimuli, or background stimuli, or 
any other stimuli that may affect the judgments even indirectly, can be 
given quantitative values, they can be pooled in a weighted average, and AL 
theory has been applied to a variety of such stimulus arrangements (Helson, 


1964). 
One of the strongest argument 
Tecent experiment by Helson an 


s for the pooling assumption comes from a 


d Kozaki (1968), in which the scale for 


judging size of squares was influenced by the differential e oh eposie 
to large and small squares. The stimuli were seven aas mar’ a ile 
on a side. For one group these were all exposed for .5 bps for a hid 
: to one second; 

ese Abr aa a on ls although the average of all 
group tl rere exposed longer, s geok 
8roup the smaller squares were exp S 

that duration 
three exposure conditions was the same. The ee re ei 
Pooled with size: that is, long exposure pi A Teal compared to the 

z all sti 

„Pward and thus reduced the jupan a of the small squares pulled 
Standard constant exposure; and long exposure ts of all stimuli compared 
the AL downward and thus increased the judgmen 


i hree conditions were 
to t me judgments 1n the t : 
he standard condition. The judg Si, hence the noeligg deni 


ee separated ee SO o he entire scale of judgment. 
appli j in stimuli but to the 

EP ted, not fust tò certain stimuli bu ith large and small anchors, exposed 
fi Furthermore, a second experiment W, ling assumption Long dura- 

. ooli . 

or short and long durations, confirmed the p S ewer judgments, while 
rion of the large anchor resulted in higher Al: AL and higher judgments 
ong duration of the small anchor resulted in lower g g 


fo) the seri A ” 
ries stimuli. tivation, performance 
ie theory has been applied by Helon eed the tem isk _ used by 
anı other to 5 £ this book, 3 
opics beyond the scope © t or frame of refer- 
others in a ig separ way as a syn ae 
ence, with no testable quantitative 1 


onym for 
mplications- 


CTS 
PRESENT STATUS AND FUTURE pore E judgment, the agreement 
n quantitative applications to PSY F 
letec D AL theory and obtained data om — bern 
impressive (Helson, 1964, Chapter 4- Inevitably, in two decades, criticisms 
, ? i tations. 
ave also arisen, directed at both data and interpre 
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‘ > ifferent 
Several investigators who have tried to apply AL tea epig ee Be 
conditions have not been able to fit the obtained a) 4 a eee 
tween a wide and a narrow range of stimuli are not matche y a ber aie 
tions (Harvey and Campbell, 1963). AL theory does not ne see. 
weights of standards varied in frequency of pr 4003 i am doa, 
Attempts by Parducci and others (Parducci, Calfee, Mars bal : sae nee 
1960; Parducci and Marshall, 1961) to apply AL theory to op = oe dif 
tions of different shapes uncovered discrepancies that led Parducc 
ferent model, to be outlined below. wie ansiitayeion 
Much of the difficulty with AL computations comes from the ass 
of a logarithmic relation between physical m 
leads to the use of the geometric mean, wi 


agnitudes and scale aes Aa 
th adjustments, for the es 
assumption of a linear function and the consequent use of ee cil 
mean yields better fit in some cases, as in length judgments ( ae of the 
Watson, 1962), and Stevens (1958) has supplied many eS may 
superiority of power functions. The demonstration that another ees ae ihe 
e more appropriate than the geometric mean does not by itself eee out, but 
concept of an adaptation level, as Helson (1964, p- 60) has pointec 
it does complicate quantitative f an external 
Another modification in AL theory, as applied to the effects gr poaa page 
anchor, will be required to account for data reported by Sarris he anchtt 
406). The present AL formul aN 
stimulus and thus accounts only 
but the recent data require that 
distance from the series stimuli. 
is another complication, 


tests of the predictions. 


ation gives constant weight to E Fig. 9-134 
for small portions of the curves A = ith 
the weight of the anchor be red i put it 

i Sn T this, 
There is nothing illogical about 


„n criti- 
: x à i as been 5 
S to interpretation, the first part of the term, adaptation, ha used this 
cized as misleading, but many scientists, since Darwin's time, hav 


: . + js not alw 
in a broad sense. The adaptation of adaptation-level theory is n 
Sensory adaptation, to be sure, but t 


logical terms, such as perception, 
objection applies to the second pari 
of the stimulus series ca 
scale by 


ays 


nho- 
x psych 
his term is no vaguer than most T as 

; ; A more 
learning, and judgment. A n ntral value 
t, level, because, although a ce 


esponse 
: a e resp 
n always be aligned with the center of th tistic has 
K y . is sta 
least-squares procedures, there is no evidence that this $ f adapt” 
any psychological significance. On 


€ might suppose from aceon caine an 

tion level that the AL could act as an internal anchor to reduce es pas 
errors of judgment but, as Rambo (1964) has pointed out, AL “i mean 
no place for natural internal anchors. This theory is concerned w 
judgments or scale values, not with variability of judgment. in the spirit ° 

At best, Psychological theories are not very satisfactory. If, ae with 
this chapter, we put adaptation-level theory in the same context ress 
quantitative Psychological theories, its accomplishments look as S has no 
the accomplishments of the others. It is true, as Stevens piers ted ad hor 
that the AL formulation contains several constants to be ae of the 
and consequently that a decisive test is rather elusive, but this "i When # 
theories of the preceding chapter and most other theories as well. 


ive a$ 
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theory, like an automobile, starts to break down, it is seldom immediately dis- 
carded. More often, it is repaired and continued in operation; at worst, parts 
of it are salvaged and reused. 

The most salvageable contribution then, one which has contributed to a 
reorientation in psychophysics, is the idea of pooling, the idea that the effects 
of various classes of stimuli—focal, background, series, anchor, and inci- 
dental—are somehow pooled, or combined, to determine judgment of the 
stimuli. It is no longer necessary to treat judgments of single stimuli isolated 
from other stimuli. The future will probably see some integration between 
the combinatory models of the previous chapter and the pooling feature of 


the AL model. 


LINEAR CORRELATION AND REGRESSION 


The complexities of judgment data documented here suggest separating the 
s about the shape of the receptor 


study of scales of judgment from question: 
function. The most efficient designs for determining psychophysical functions 
are not the most efficient for the study of context effects, central tendency, 


anchoring, multidimensional combinations, etc. In the interests of clarity, 
chosen stimulus materials that yield 


stimuli so as to achieve linearity. 
dard condition, it is possible to vary 


Some investigators have deliberately 
linear data and others have spaced the 
Thus, once linearity is achieved in a stan 
the conditions and study judgment effects in a relatively pure form. This 
two-step procedure illustrates the advantages of experimental standardiza- 
tion as compared to statistical adjustment and, at the same time, permits the 
use of familiar linear correlations. 


From the many experiments on context effects, it is reasonable to assume 


that when the subject is judging a single stimulus, he is simultaneously cor- 
relating the whole scale of judgment with the whole series of stimuli. He 
knows the scale he has been instructed to use (e.g., the numbers 1 to 7), 
has been instructed to attend to (e.g., length). He is 
environment, that the stimulus series is 
range. Hence his task can be treated as the 
yn segment of a number system to a 


Partially known segment of a stimulus dimension. This assumption leads to 
the use of the consenton correlation coefhcient to measure the overall 
ay of the judgments and de conventional regression equation to esti- 

ate the scale v stimuli. : ù 

When pa las peip judged in ordered categories, pi men can 
© arranged as a rectangular table of cag e ata a rich a 
Convenient for correlation analysis. But when the corre aa coefficient is 
computed to get a regression equation for estimation of z scale pat of the 
Stimuli, it is obvious that the scale values depend on the magnitu e of the 
Correlation coefficient. When the correlation appar rme judgments 
and stimuli are closely matched, with little variability. High ratings go with 
large stimuli and low ratings with small stimuli. But when the correlation 
ìs low and the regression line approac 


and the dimension he 
told, or he infers from the laboratory 


li P 
imited to some manageable 
matching or correlating of a knov 


hes the horizontal, large stimuli are 
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Judgments 


FIGURE 9.16. Regression 
of judgments on stimu f data 
two exposure times. eit; 
o points are mean judgms from 
the vertical lines exten@ I 


line is 
g to Qs. The heavy ! 
12 16 20 24 28 z pa regression ay 
imuli From Johnson and Mu 
iii (1969). 
l “ 
snderestimated and small stimuli Overestimated. Thus, treating catego") 
Judgment as a matching task leads to a simple explanation of the centr 


as i ments 
sequence of the variability of the judg 


em hapa to the magnitude of the correlation ea to 
central tendenc be a. model was illustrated with ae Johnso? 
(1952, 1955) a uilford (1954), with fictitious data, and by se. The 
SiTe alti wie category judgments collected for another ne 
inf 5 appear in matching data obtained by adjustment me ex- 
in fact, Hollingworth’s first illustration was taken from an adjustment x 
periment—and Stevens and Greenbaum (1966) have reported several n 
amples of central tendency in the matching of values on one P aep $ 
continuum to those on another. They interpreted the constriction of the 


and is inversely related 
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tained scale values in terms of the regression that occurs “whenever the 
results of the matching judgments yield less than a perfect correlation.” 

Recently, to examine the implications of this approach more thoroughly, 
Johnson and Mullally (1969) held stimulus conditions constant and used expo- 
sure time as an independent variable. They found that a decrease in exposure 
time decreased the accuracy of judgment of each stimulus (slides containing 
green and blue dots, as described on page 402) and therefore the judgment- 
stimulus correlation. Fig. 9.16 shows regression lines computed from the 
judgments at two extreme exposure times: at 1.00 seconds when the correla- 
tion was .75, and at .04 seconds when the correlation was .29. The central 
tendency of judgment that appears in the graphs is also manifested in the 
standard deviation of the scale values which, in units of the 11-point rating 
scale, was 2.27 for the long exposure and .97 for the short exposure. Fig. 
9.17 charts these relations by comparing the two sets of obtained scale values 
With those to be expected in the hypothetical condition of perfect correlation 
between judgments and stimuli. 

The correlation-and-regression model is easily applied to the effects of an 
external anchor if the anchor is judged as the other stimuli are. The data 
graphed in Fig. 9.14 to illustrate anchoring effects are pertinent here: the 
anchor was a salient stimulus that tied down one end of the scale, but in 
each case the judgment scale as a whole was adjusted to the extended stimu- 
lus series as a whole; Hence, the assumption of linearity holds for each 
regression line. The usual contrast effects can be seen, but the conventional 
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‘. iti nal 
regression equations account very well for all scale my Nadie dag ee 
assumptions. The linearity may break down, of course, if t F an 5, but it is 
is extended too far (as noted above in the section on anc oring 2 atag 
possible that judgment of the anchor stimulus reduces this danger by 
the anchor a part of the regular stimulus series. Sitti eae 

This model accounts not only for contrast but also for assimilation rd 
certain conditions. The data graphed in Fig. 9.14 show that in the jp No. 
condition the mean judgment of Stimulus No. 11 was about 2. n nae 
1, a remote anchor, produced the usual contrast effect, such that : a tau ve 
judged about 3. Now Suppose a near anchor had been psig a that en 
No. 10. This would presumably be judged a 1 and would tie down i 2 15. 
of the scale so that, if linearity obtained, No. 11 would be judged a + rep- 
In comparison to the judgment of 2 under standard conditions, th ri e 
resents movement toward the anchor, or assimilation. Cerera lieANan. E less 
illustration leads to the prediction that when judgments are e i 3 
than complete accuracy and central tendency is present, the intro w on 
a near anchor ties down the near end of the scale, reduces central ten ) 
and produces assimilation. . vhenever 

Regression toward the mean is not just a conjecture; it occurs \ he data 
the correlation between two variables is less than unity. But when a ne 
are not suitable for correlation analysis, the regression may be intl 
confounded with ad hoc statistics. Correlation and regression are — a 
to be sure, rather than explanations; the model attributes the me vana 
the correlation and consequently the scale values of the stimuli to th 


iability 
oe F : wn . s variability 
bility of the single judgments. Any condition that influences 
will influence the dependent statistics. 

Once the regression e 
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the scale of judgment i 


: ne 
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å re; 
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gories vanish. 
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toward the mean incr 
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hod 

Regression effects apply also to judgments made by the ranking <a or 

if mean ranks are obtained from repeated rankings by the same eal the 

from single tankings by different judges. Mean ranks regress a cae the 
center of the scale in inverse relation to the average correlation betw 


ranks. The correlation model is presumably applicable to judgme 
unbounded scales, as by f 


next. 


| are 
. > mode 
model and the correlation-and-regression 
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similar in that both involve a least-squares fit to obtained data. The former, 
however, assumes a logarithmic relation between judgments and stimuli, 
while the latter assumes a linear relation. But the distinctive characteristic of 
the correlation approach is the assumption that the statistic does what the 
judge does, namely, correlate judgments with stimuli. AL theory states that 
the stimuli are scaled in relation to psychological distance from the AL. The 
correlation model states that the scale as a whole is adjusted to the stimulus 
series as a whole, even though the judgments are made one by one. 


RANGE-FREQUENCY COMPROMISE 

Parducci’s extensive experiments with many different distributions of 
numerals (Parducci, Calfee, Marshall, and Davidson, 1960) and lengths 
(Parducci and Marshall, 1961) disclosed discrepancies in predictions of the 
central value, or adaptation level, of a scale of judgment from the mean of 
the stimulus distribution. The obtained central values seemed to depend on 
two characteristics of the various distributions: the midpoint and the median. 
According to Parducci, the predictive usefulness of the midpoint comes from 
the judge’s tendency to divide the range of stimuli into two equal subranges; 
and the predictive usefulness of the median comes from his tendency to 
divide the distribution into two equal frequency distributions and thus to use 
both sides of the response scale with equal frequency. The predictive useful- 
ness of the mean, either arithmetic or logarithmic, is an accidental conse- 
quence of the fact that it often falls between the midpoint and the median, 
and for that reason is often a good first approximation. 

The method devised by Parducci (1963, 1965) for applying this model to 
the whole scale and predicting mean judgments for each stimulus makes 
use of the limens between adjacent categories and asserts that each limen 
results from a compromise between two principles. The range principle states 
that the judge divides his psychological range into as many subranges as there 
are categories for him to use. If told to judge in four categories, he would 
divide the range into four equal subranges. The frequency principle states 
that the judge puts the same number of stimuli in each category. If told to 
Judge in four categories, he would divide the stimulus distribution into four 
quarters. When the distribution is not rectangular, these two principles con- 
flict, and the range-frequency model asserts that the judge gives each equal 
Weight. Thus the empirical limens will be means of the corresponding range 


limens and frequency limens. 
Tests of the model begin wi 
tangular distribution of stimuli, 
get empirical limens and theoretic: 
the means of both the frequency a 


th a standardization experiment using a rec- 
9 squares judged on a 6-category scale, to 
al limens. Since the empirical limens are 
nd the range limens, the range limens are 
obtained algebraically from the other two. This procedure gives hypothetical 
range limens that may be outside the actual stimulus series, presumably be- 
Cause the judge’s expectations may not be limited by the stimulus series. 
These range limens are then used for other distributions of the same stimuli— 
Positively skewed, negatively skewed, U-shaped, etc.—but new frequency 
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oe that the new 

limens are computed for each distribution. The model a a aeS; 

= irical limens will be the means of these two sets of theor a from 

e s ie ral 

| the predictions are tested by comparing the le go P stiimilus. Fot 

; i sach : 

the compromise limens with obtained mean judgments o a cand, When 

12 distributions of various shapes, the agreement was qui e a with 

he stimulus series consisted of bursts of random noise (cor d for the 

nt i h frequency model accounte 

positive and negative skew), the range-freq y } eceding chapter 

judgments very well, though the averaging model of the p i len cael 

aud ah intensity-summation model were inadequate (Pard sie oane A 

Anderson, 1968). And this model can be applied also o a 1969). 

of anchor and postanchor effects (Parducci, Perrett, m ai foe te: UR 
a : ” ; el wi 

Because it is a compromise, the range frequency m noromise betien 
: aie: oe: o 

sidered either parsimonious or elegant. But if judges yh es one renee © 
fm : i ' An Í 

these two principles, a compromise model is necessary. Aecualaeteats, Pardo 

account better than any other for the wide variety of oe hich assumes 

. : A . J e b Ww ds 

has Investigated. In contrast to the adaptation-level_ mo Semen model 
i i ged in relation to the > ge- apenas hi 

that each stimulus is judged in rel to the AL, the range f Ekne I This 

; r : : romise 

states that each stimulus is judged in relation to the comp on Javel theory to 

is the reason given by Parducci for the failure of ee Sons, Like the 

account for category judgments of these widely varied distribu 


“ith 
. ad only W 
adaptation-level model, the tange-frequency model is concerne 


acy may 
: accuracy 
mean scale values; it ignores accuracy of judgment, although 
be involved in th 


e values obtained for the hypothetical range ong impli- 
Since compromises are not as exciting as monoideic eS expect 
cations of this model have not been thoroughly explored. One a ia of the 
that variations in instructions could shift the emphasis from one si 
compromise toward the other and yield predictable consequences. 
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There is plenty of evidence j i 
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magnitudes; but, when t 
of weight judgments le 
to a collection of stim 
important assumption is that “the di 


f physic 
or contribute more strongly to discriminati A i 2 
stimulus values [p. 2].” 


difference? 
r 2 . Decode ight di 

where density differences are teadily discriminable, but weig 

are not, density 


ily in the magnitude judgme 
weight. This vari 

cal function for 
by shifts in the 


judgment to the new series. 
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The tests of the model were indirect for the most part, but one that is 
convenient for illustration involved shifts of series with the same weights 
while density was varied. This was done by making some weights of high- 
density plastic and some of low-density tin. All subjects had a standard weight 
of 200 grams, presented once at the beginning, with instructions to call it 100 
and to judge other weights as proportional to this. The preshift series con- 
sisted of five weights from 100 to 300 grams and the postshift series of 
five weights from 700 to 900 grams. Thus the shift in weight for all condi- 
tions was the same, but the mean shift in density, expressed in grams per 
cubic centimeter, varied with the materials: from plastic to tin, —.64; from 
tin to tin, .45; from plastic to plastic, 3. 5; from tin to plastic, 4.82. The 
results clearly demonstrated large separations in the postshift judgments, re- 
lated to the density shifts. 

Unfortunately, all the judgments drifted upward, as if the weights became 
heavier or the subjects became weaker as the experiment continued, even in 
the preshift phase and in control groups that did not shift. One would expect 
that the judgment scales would be symmetrical around 100, but very few 
judgments below 100 were reported. This does not invalidate all the con- 
clusions, but it raises questions about the meaning of magnitude estimates 
for weight and confuses the evaluation of the trends. The meaning of density 
is also unclear, since size was varied as well. A sharper test of the theory can 
be made, the authors tell us, when it is possible to estimate the representation 
of stimuli by individual subjects. Nevertheless, the results of several experi- 
ments—including judgment-stimulus correlations and regression lines— 
though limited to weicht judgments, strengthen the case for a variable-basis 
model and may how the way through the complexities of judgments of 


multidimensional stimuli. 


THEORETICAL DEVELOPMENTS AND 

UNANSWERED QUESTIONS 

To a critical reviewer, the methodologica : 
ments on psychophysical judgment is illusory. With one hand m aa 
menter displays graceful curves that demonstrate how beautifully 7 mag i 
formula comprehends natural phenomena, while his other han surrep 
tiously removes the conditions that would disturb the demonstration. 

The reasons are historical. Psychophysics began by seeing single a 
sions in complex natural objects and then constructing experimental 2 jects 
that were more nearly unidimensional. Though such standardization of stim 
uli has obvious advantages for psychophysics, in the study of eee it is 
an oversimplification that has been at least partly corrected by the recent 
interest in multidimensional psychophysics. 

The fundamental sents about the part and the whole that Gestalt 

5 ‘ i osed once again in the context of 
Psychologists insisted on asking are being pos ye. bem, Ties me 
judgment research, though perhaps in a more researc : e form. eae 
chapters have owt ie flexibility of judgment experiments an eir 


l sophistication of most experi- 
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capacity to pick up subtle effects, Clarification of the nee be A i 
wholes may prove to be one of the main contributions of the Ps) 5) 
judgment to general psychological theory. , E e 
Everyone knows that the unidimensionality of a set of stimu us : J Fe 
largely due to control of the subject’s attention by gece he se a 
experimenter, but this control was long considered a sai ae Eo 
necessary for scientific investigation, Recently, however, the subjec eils; 
tion has been experimentally varied, by variations in the stimulas Ë ps Ta 
and by instructions. Many more such variations can be eee A aait 
future, Perhaps the role of attention in these judgments can be rng a 
directly, by asking the subject to distribute his attention even) in: 
dimensions, for example, or to concentrate on one rather than ine a 
As to the relation between parts and wholes over time, or the negn Te 
of events by learning, the parallel between the judgment E ap 
concept experiment is most instructive. Knowledge of the class is b 
from experience with single inst 
turn depends on knowledge of th 
aligned with the stimulus series 
response to the single stimuli 


scale to the stimulus series. In either case, the subject can pe eee 
Sponses or he can, if asked, talk about the concept or the scale of ne with 
The chief difference is that concept experiments typically are concerned an 
combinations of familiar discrete attributes, while judgment experiments 
concerned with combinations of v. 

e Previous chapter defined 
kind of problem, and the recent 


tractive. It appears that the subje 


do what he think 
tion is available 
salient anchor st 
dimensional an 


re- 


alues along one or more dimensions, special 
judgment, in an abstract WAY); as 4 ee at- 
experiments make this definition i ts, t0 
ct who tries, as in concept jos rare 
s the experimenter wants him to do uses whatever in 
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imuli, and from the instructions. When the stimuli tic 


d the stimulus Series is shifted up or down, the one 
more apparent. In these more aie 
dardized, less oversimplified, more s the 
d one with no clear feedback, hene. of 
is from observation of the consistency 
his own judgments. -e musl 
To the disenchanted Critic, experiments on shift of stimulus a Iving 
be ambiguous to the subject, like experiments on shift of PADET SR 
tty to hold on to the old context, or to shift to Senet 
ue to use the scale Categories or the magnitude ae a 
© compromise, as in many Michigan 
formal class experiments at d high 
J € asked to make category judgments of low E # 
following a shift in a series of auditory frequencies, the usual shift ae 
when the subjects, who were students of erpen? gave 
psychology, were asked to judge the stimuli in cycles per second, they 8 


set. Does the subject 
Does he try to contin 


m 


cale 
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rather realistic judgments. Though their accuracy was not high, their judg- 
ments gave clear evidence that they knew the stimuli had shifted. It might be 
reasonable, when the situation is complex, to speak of judgment strategies 
and to try to identify them. Instructing the subjects to use two different 
response systems may allow them to tell the experimenter more clearly what 
he wants to know; an experiment with a definite criterion of accuracy known 
to the subject might indeed clear up the ambiguities. In any event, the 
problem is solved from the subject's point of view when he makes judgments 
that seem consistent to him. 

From this perspective, the question whether anchoring effects and realign- 
ment of judgment scales involve changes in perception seems less than 
crucial. Recent experiments by Helson and Kozaki (1968) and by Bell 
and Bevan (1968) offer strong evidence of perceptual changes—whether 
related to adaptation level or not. And experiments on abstract judgments 
Cin the section below on attitude items) demonstrate similar effects with no 
special relevance to perception. As in a court of law, where judgments are 
based on both substantive and procedural considerations, laboratory judg- 
ments are based on perceptual variables and judgment scale variables, but 
neither has any more inherent validity or psychological interest than the 
other, f 


Regardless of the procedures and logical implications, there are incon- 


sistencies in the published data that are probably due to differences in stimu- 
lus materials. Some experiments are More conveniently done with auditory 
stimuli and others with visual stimuli. A cautionary note 1s required for 
judgment experiments as for problem-solving experiments and coner pt iex 
Periments: conclusions drawn from an experiment on one kind of stimulus 
material are hazardous. The difference between the judging process on one 
task and another may be only in a parameter, but it may also consist in an 


additional operation. 


Judging Attitude Statements 
and Other Social Objects 


Both affective and abstract judgments are involved in the testing of atti- 


: ces affective judgments when he 
a Ti subject taking ihe m PA th a 35 immoral.” The 
’pproves or disapproves such statements as ir eae : i 

judges who aid in the construction of the scale make abstract judgments 
when they rate such statements on 4 scale of 1 to 11 to indicate the position 


of each on an abstract attitude continuum As attitude statements have been 
of judges, questions have arisen, espe- 


Sealed wi > ets 

with of different S > ; 

cially A a aR of context an of the judge’s own attitude, that are 

relevant for the psychology of thinking in general as well as for psycho- 
* J 5. 

metric methodology. 

n Many influential variables (such a 

Interest the social psychologists construc 


s degree of emotional involvement) that 
ting attitude scales cannot be con- 


a A EAR 
432 % JUDGMENT: RESPONSE VARIABLES AND SCAL 


sidered here—as many variables influencing perceptual pgri ie 
not considered. It is the cognitive variables related to me a 
of judgment that are most relevant to the psychology o “ip sel dow be 
When a judge is asked to rate attitude statements, it is ot ote ih 
has conceptualized the relevant dimension from previous ona nine 
such statements. Presumably, abstract dimensions are learned as . r lahalttni 
cepts are learned. Scaling experiments are not concerned with — seu: vue 
of the attitude dimension except as it affects Variability of Hogamih hjani 
directly with the position of statements on the dimension. Most of ; som for 
has used category scales, but magnitude estimation has also om k h 
scaling social objects (Stevens, 1966b). Both types of scale were ealn we 
in an extensive project on the measurement of delinquency (Se 


Wolfgang, 1964). 


THE RANGE OF OPINION 


ci is pre 
Typically, a series of Statements covering a wide range of opman FE 
sented to the judges and the scale is aligned with the series so that Si ante 
favorable statements get high scale values and extremely unfavora e 
ments low scale values. This correspondence does not necessarily jars le 
the effect of the immediate context on the development of a ot ati 
as in the other examples considered in this chapter. The taupe ia! asi is 
on the issue may be known to the judges in advance, hence if a ees 
presented to them for scaling, the correspondence that results Oy aoitatt 
sequence of considerable experience in the past rather than ad hoc I: thie 
experience, Attitude statements m P erai render 
ment of an extreme opinion often Suggests—to the impartial, informe 
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the response word right. 
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statements th: i 5 n 
nts that are ambiguously written are more vulnerable to various insta- 


bilities (Zimbardo, 1960). 
A study of college students’ attitudes toward fraternities failed to find a 
a ae Kind bet did ce ge effect. ae (1959) 
dlistev vote od gi statements into an un avorable range Cwit previously 
E me 93 scale values of 1 to +) and a favorable range (with scale values 
ices n The unfavorable range was rated 1.2 to 4, and the favorable 
ee al o 6.6, showing considerable resistance to the context effect. How- 
» when the group that rated the favorable statements before the unfavor- 
ie ahs a group that rated them in the reverse 
Sieibieries nip of contrast appeared. Subjects exposed first to favorable 
who u i s judged the unfavorable statements more unfavorably than those 
judged the unfavorable statements first. Conversely, those exposed first 


to fa 2 
unfavourable statements judged the favorable statements more favorably 
statements first. And this contrast effect 


of attitude toward fraternities. 
Iternated once, it can be alternated several 
-measurements design in studying 
and found contrast at first, then 
similar is unclear, as Atkins has pointed out, but 
ie had previously been found in judgments of aer in 
ee escriptions of behavior (Bieri, Orcutt, and Leaman, ), so the 
s are not accidental. Whatever the interpretation, it appears to be 
udgments and a repeated- 
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intext effect of the usual 


able state : 
tatements was compared with 


a exposed to favorable 
If the throughout the whole range 
times, ae can be thus a 
me kins (1966) tried this repeated 
M students’ attitudes toward fraternities 


imilati Th. ¢ . 
ilation. The interpretation 


h social j 


ext effects earlier in this chapter noted that the 
production as well as judgment, and a 

(1966) for attitude statements. 
. ted by different subjects 
cribing Category 11, the 
1, the anti-Negro end. 
it became evident that 


Bs pee of conte 
Similar eff, Dren can influence Ps 
ee ee - Lissa reported pik pp n 
and, after oes statements about T grin yo 
Pro-Neoy ating, each subject wrote a statem 
egro end, and a statement describing Category 


by other judges, 


nen née 6 
these statements were scaled 
dependent on the ranges 


the descri 
: escripti 
judged. ptions of the end of th 


THE 
HE JUDGE'S OWN ATTITUDE 
ves scrutiny in respect to the use of a 


The ; 
eC a 
judge’s own attitude also deser AAE DAN 
e rests ON the assumption that judges 


Sca S 
mea judgment. The scaling techniqu ei give rasjor weigh 
is Pi © to conceive an abstract attitu uum 4 Bia neh ‘iis i 
sy relevant abstract implications of each se tenis about sow sii 
to uires thinking in abstract, impartial quantitative, me ? jona 
ce kg he ing lnt omne 
are . . 1 t7doments S 
supposed to make impartia judg k whether anyone can do this. 


o . 
u $ ie . 
quiet legal categories, and it is not idle to 8 


434% JUDGMENT: RESPONSE VARIABLES AND SCALES 


This is not a question of random error. Scaling projects use many judges 
so that random errors will cancel. The danger is that judges with different 
attitudes will systematically slant the judgments one way or another. The 
tesearch here has been directed toward the practical question of hu 
the assumptions of the scaling technique are violated, and also ocr 
people's reactions to persuasive communications intended to modify thei 
attitudes. 

Hovland, Harvey, and Sherif (1957) applied the assimilation-contrast 
model to this problem by treating the judge's own attitude as an anchor. Ton 
did not measure the anchor effect directly, as by a reduction in variability 0 
judgment. Rather, they assumed that the judge’s own attitude would act ee 
standard and that opinions at other points on the scale would be evaluate 
in reference to this standard. Contrast means that 
attitude far from the judge’s ow 
fore rejected; assimilation means 
even closer. When they had subjects of 
attitude toward alcohol e 
dence for contrast in that the drys 


ere obtained by Manis (1960) in responses T 
ages. The committed subjects at each end i 
nciations as more extreme than did the neutra 
ence for assimilation was slight. The effects a 
are not limited to the extremists, however, me 
dgments of subjects whose own attitudes locate 
them at five positions along the scale (Zavalloni and Cook, 1965). ef 
A later study by Sherif and Hovland (1961) considered the involvement 


of the judgment scale, whi] 
Susceptible to differences 


ee t Z fem- 
» It is not necessarily a matter of judgment. N 


A : hich give 
cific experiences which 8 


TE x ne. 
to favorable, It is worthwhile observing an individual’s response to each ae 
He may divide the Series in two, and accept those at one end while reject! k 
those at the other. More commonly, if the series includes a full range 
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may be called latitudes of acceptance and rejection (Sherif and Hovland, 
1961). Typically, the extreme statements at both ends are rejected by almost 
everyone: the differences are in the range of acceptance between these ex- 
tremes, Fig. 9.18 illustrates ranges of acceptance and rejection of statements 
spaced along a continuum extending from A (extreme Republican partisan- 
ship) to I (extreme Democratic partisanship). The E graph shows the pattern 
of acceptance and rejection for subjects whose own attitude was at E. They 
approved statements at D, E, and F, and rejected the extreme statements 
at both ends. The other graphs show latitudes of acceptance and rejection 
for subjects whose own attitudes were at B and H. There are transition zones 
between ranges of acceptance and rejection, of course, where noncommitment 
or uncertainty is the more common response. The E people, in the middle of 
the road, were most uncertain about statements at C and G. Usually, the 
subject in the middle finds relatively few statements that he can reject; the 
larger number of rejections come from the judges at the extremes. 

Likewise, when statements are judged on a scale of 1 to 11 as to truth or 


falsity, the extreme statements, extremely dry or extremely wet, are rated 


most false. Naturally, the drys rate the extremely wet statements false, but 
5 è false than the moderately 


they also rate the extremely dry statements mor 
dry statements. In the same way, the wets rate the extremely dry statements 


false, but they rate the wet statements more false than the moderately dry 


statements (Sherif, Sherif, and Nebergall, 1965). 

When Segall (1959) asked college students to rate statements about 
fraternities, he did not find a difference due to the students’ own attitudes, 
Possibly because the range of opinion was small. He did find, however, that 
these students—when explicitly asked to judge the statements in relation to 
their own attitudes—were able to use their own positions as standards and 
Sive different scale values as expected. This is another illustration of the 
ability of sophisticated subjects to adopt a flexible approach to the judgment 
task; certainly, the meaning of the results depends on the subjects’ under- 


standing of the task. 
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INTERACTION BETWEEN JUDGE'S ATTITUDE 
AND RANGE OF OPINION 


Because both the range of statements to be judged and the git 
attitude have had significant effects in some experiments, Upshaw Sard 
formulated a model that includes these effects and their interaction. = 
important addition is an interaction hypothesis, which states that the ju a 
attitude is critical when it is outside the range of statements presented. oe cm 
the judge’s position is within the range of statements presented, it is — 
that he adopts a reference scale defined by the statements. But a 7 
position is outside the range, the judge extends the range to include his - 
position and adjusts his reference scale to this wider range. A meager i 
of this extension is that the middle of the scale moves in the direction of tie 
judge’s position and produces a contrast effect, like the effect of a me 
anchor, so that a statement judged a neutral six by in-range judges would ta 
judged more favorable, perhaps seven, by the judges outside the range aii o 
unfavorable side and less favorable, perhaps five, by judges outside the rang 
on the favorable side. Another consequence of the extension of the scale s 
that out-of-range judges necessarily use a scale of wider units than in-range 
judges. ; 
To test this model, a series of statements covering a wide range of ane 
toward the Negro (one without the extremely anti statements, and one coed 
out the extremely pro statements) were given to anti, neutral, and pro ee = 
for rating. The results demonstrated that the judge’s own position is ep 
cially important when it is outside the range of statements presented and tha 
in this condition the judge extends the scale to include this position. is 

Indirect evidence for assimilation can be uncovered in Upshaw’s 
(Manis, 1964). But the assimilation hypothesis requires that scale aliea 
statements near the judge’s position be displaced toward his position, and t T- 
implies some departure from linearity in the relation between scale values ‘ed 
tained from judges of different attitudes, Upshaw’s (1964) reanalysis yielded 
linear correlation coefficients near unity, which appears to be direct evidence 
against any serious displacement of scale values. Other recent investigations 
have reported very high correlations between scale values obtained ei 
groups of different attitudes COstrum, 1966) and by different scaling tec” 
niques (Hicks and Campbell, 1965). The importance of the judge’s own an 
tude in determining a range of acceptance and ranges of uncertainty and - 
locating the statements that will be rejected is clear; but the evidence for 3 
shift in scale values due to assimilation is disputable. i 

Departing somewhat from the method of judgment in given categories, 
Sherif (1963) allowed her subjects to choose their own categories to indicate 
ranges of acceptance and rejection. White and Indian high-school students 
considering the purchase of a warm coat simply sorted price tags—from = 
to $54 in one series and from $5 to $104 in another—to designate those a 
were too cheap to consider, within a tange of acceptable values, and po 
tive in price. The context effect showed up as usual in that the rang 
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deemed acceptable varied with the values presented. For the white students, 
the upper boundary of the acceptable range was $49 for the short series and 
$80 for the long series. But the context was not the only determinant of 
the categorizations. These boundaries for Navaho Indian students, accustomed 
to lower living standards, were $38 and $70. Although the effect of the 
attitudes of the two groups was not as large as the context effect, it was 
clearly significant. And this must be called an effect of their own attitudes 
because a parallel experiment with the same numerals but with no reference 
to price tags disclosed the same context effect but no difference between 
whites and Indians. 


ATTITUDE METHODOLOGY AND JUDGING OPERATIONS 


From the standpoint of the psychology of judgment, this discussion exhibits 
the generality of certain principles of judgment—they apply to categories 
spaced along highly abstract social dimensions as well as more concrete per- 
ceptual dimensions—and clarifies some limitations. From the standpoint of 
social psychology, these are methodological considerations. The social psy- 
chologist has to know the consequences of variations in judging operations 
in order to control for them and to obtain evidence of genuine attitude 
differences and changes. Despite the inconsistencies, a few useful generaliza- 
tions can now be gleaned from the research reviewed in this section. 

1. The test constructor, who wants to get scale values of attitude statements 
and assemble a test for use in attitude research, can stay out of trouble unless 
he is looking for it. The differences between judges are of two types: differ- 
ences in level or reference scale origin, and in dispersion or scale unit. But 
absolute scale values are not necessary (Upshaw, 1965). If the test con- 
structor avoids extreme judges, discards ambiguous items, and, in general, 
follows good psychometric methodology, the scale values derived from one 
Stoup of judges by one procedure will be linear with scale values derived from 
others, 

2. Context effects sometimes do occur in the production and judgment of 
attitude items, but they are not as large or as consistent as in judgment of 
Physical objects. Most demonstrations of context effects consist only of differ- 
ences in scale values of a few statements when judged in different attitude 
Tanges, 

a This does not imply that the context is unimportant but, more likely, that 
the important context is the range of opinion previously encountered in 
mass media, in reference groups, and in one’s own direct experience. 
gainst these familiar and easily remembered ranges, the range of statements 
Presented in the laboratory can yield only a recency effect. Thus Segall 
(1959) did not find the conventional context effect from judgments of 


two ranges but did find a recency effect due to the order in which the 
‘wo ranges were presented. As Sherif, Sherif, and Nebergall (1965) have 
Pointed out, shifts of this kind can easily be confused with true attitude 
Changes, 

Studies of context effects on the judgment of opinion statements have not 
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applied the quantitative models described earlier in this chapter. Hence 
whatever context effects occur can equally well be interpreted in terms of 
adaptation level, linear correlation and regression, range-frequency compro 
mise, variable perspective, or some other model. The special logarithmic fea- 
ture of the adaptation-level model has not been used when the term has been 
applied to social judgments. Distribution effects, which the range-frequency 
compromise handles well, have not been studied, and might not show up 
because of the variable frequencies of previous exposure. The linear correla- 
tion-and-regression model has the advantages of simplicity and wide applica- 
bility, although correlation coefficients should perhaps be calculated for 
individual judges because subtle effects, such as assimilation, obscured in 
group data, might be manifest in individual data. 

3. The judge's own position on the scale may not have much practical 
significance for test construction, but his division of the scale into subranges 
or latitudes of acceptance and rejection, which is dependent on his own 
attitude, is an important part of the judging process and must be included in 
any comprehensive theory. A statement rejected by one judge for one reason 
may be rejected by another judge for another reason, just as a middle-of-the- 
road candidate may be rejected by some as too liberal and by others as too 
conservative. 

These latitudes of acceptance and rejection have another significance 
beyond dispute. It is at the boundary between one range and another that 
uncertainty reaches a maximum. The effect of distance from a category 
boundary on confidence and speed of judgment will be discussed later; state- 
ments near a boundary are also particularly susceptible to change from Te 
jection to acceptance, and the reverse, i 

4 Anchoring effects are frequently reported for end stimuli, and occ 
sionally for middle stimuli. f sa 
reduction in variability and error of judgment. Though anchoring effects 
aes nega in many discussions of social judgment, data on variability 
von pre Some heres the no sf SES 
dence that these on hav ad le, bet no one has deme es era 

ave special anchoring effects beyond their gen®™ 


. . . . i 2 
contribution to the context. Nor is the judge’s own position an anchor in thi 
respect, as far as we know. 


The best measure of such anchoring effects 1 


Accuracy of Judgment 
The preceding cha 
accuracy of judgmen 
been concerned 


Pter considered the influence of stimulus variables A 
ee Much of the research reviewed in this chapte" n 
with principles of judgment that influence stability of pea ~ 
values over variations in conditions and judges rather than with accurac} 
per se, although the correlation-and-regression approach is an exception in 
treating scales values as dependent on accuracy onan for its ow? sake 
can be estimated by measures of association Rakes judgments and stimu! 
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such as the correlation coefficient, the contingency coefficient, and amount of 
information transmitted or uncertainty reduced—as well as by direct counts of 
errors, when the data permit. 

Many people complain, when asked to commit themselves to precise 
judgments of complex objects or events, that the task is “too subjective.” 
It is subjective, of course, just as the assertion that the sky is blue is subjective. 
And it is true that many human judgments are full of error. But it is also 
true that accurate judgments of complex materials, in the sense of agreement 
between independent judges, can be obtained under good conditions, as in 
some of the investigations described in previous chapters Ce.g., Johnson, 
Parrott, and Stratton, 1968). Under good conditions, judgments of students’ 
English compositions are accurate also, in the sense of correlation with more 
objective scores (Godshalk, Swineford, and Coffman, 1966). The emphasis 
in this chapter on variability and error tendencies should not obscure the 
fact that high accuracy can also be obtained with reasonable effort. 


NUMBER OF CATEGORIES 

Foods, personality traits, and other complex objects of judgment are often 
rated, for research or practical purposes, on scales of 5 or 7 categories, but 
what is the optimal number? In this situation psychologists seem to admire 
odd numbers, especially that enchanting number 7, but it is safe to say that 
most ratings scales have been too short. Guilford’s (1954) broad discussion 
of rating scales, based mostly on correlation studies, came to this con- 
clusion. Several experiments (Bendig, 1954; Eriksen and Hake, 1955; Gar- 
ner, 1960) have shown that more information is transmitted by longer rating 
scales, even up to 20 or more categories. In general, the response scale should 
allow the judge to communicate all the discriminability he is capable of, al- 
though this recommendation may include some circularity because the dis- 
criminability is not independent of the method of measuring it. Differences 
reported in published research, though small, favor the longer scales. 

Judges often complain that they cannot make the fine distinctions re- 
quired by scales of many categories, but this complaint reflects the subjective 
difficulty and doubt associated with judgments near category boundaries 
tather than objective data. One can argue in the other direction that the 
scale should be chosen so that all judgments are difficult, since otherwise the 
Judge's ability will not be used to the best possible extent. l 

It is possible that unbounded scales of unspecified categories will turn 
Sut to be either more or less accurate than the more conventional scales of 
fixed categories, but there seems to be no evidence on this point as yet. 


ANCHORS AND ACCURACY 
ihe common practice of defining the 
th called anchoring the scale, and it doe 
© differences are small (Bendig, 195% 


Cent 
intr y, somewhat different results were Te 
oduced a variation in weight judgments by 


ends of a scale by short phrases may 
s seem to increase accuracy, though 
Bendig and Hughes, 1953). Re- 
ported by Noizet (1967), who 
having one group of subjects 
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sort the weights into unlabeled boxes. With 5 weights and 5 boxes this re- 
sponse system vielded better accuracy Ccounting systematic and variable 
errors) than the conventional verbal system, very light, light, medium, heavy, 
very heavy, used by another group; and the author suggested that the verbal 
labels may have produced asymmetrical anchoring. The sorting responses were 
apparently more difficult to learn than the familiar verbal system, however, 
because 9 out of 42 subjects switched boxes during the experiment and had 
to be dropped from the analysis. 

Anchoring stimuli can also be spaced along the stimulus dimension to im- 
prove judgment. Wegryn (1964) selected 5 salient colors out of a series of 
25 by a standardization experiment and then had another group of judges 
use these as reference points for category judgments. With these salient 
anchors strategically located, the second group was able to transmit 4.4 bits of 
information, much more than usual with unanchored scales. (The effects on 
accuracy of natural anchors—such as the vertical and the horizontal—and 
of supplied anchors were mentioned earlier, pp. 400-404). 


IMPROVEMENT WITH PRACTICE 


Improvement in accuracy of judgment as a result of practice has been 
demonstrated in many situations. For example, the accuracy of a panel of 
housewives judging ground beef for the proportion of rancid beef included, 
improved from the first to the third week of training, as measured by the 
variance of the judgments and the slope of the regres ion line (Bennett, 
Spahr, and Dodds, 1956), Accuracy could improve with learning of 
distribution of stimuli Cas suggested earlier), with identification of single 
stimuli, with mastery of the response of the 
response system to the distribution omi 
experiment on judgments of dist: 


Purdy, 1955) w 
transferred to a 


system, or with adjustment | 
of stimuli, The improvement 10 
ance in an open field (Gibson, Bergman, : 
as attributed to the development of a scale which could is 


different field. Eriksen (1958) looked into the value of a 


AREE er de = gain 
Correction procedure in judgments of sizes and brightnesses, but the ga" 
from corre x 


ction was rather small in terms of information transmitted. He 
concluded that “what S learns on this task js not to recognize individua 
e as such, but instead to establish a frame af E Uh the series ° 
stimuli aa whole [p. 358].” As these experiments are usually carried out, 
the subject can correct himself by observing whether he has used the cate 
pones uniformly, so correction from the expedimentit may not add much. 
tom R z e neem of physical magnitudes also can ‘improve accuracy 
of judgment of physical maon; ce 

Terborg (1966) ee aa CHow me _— : c optim 

i 7 : i point and raised the question of the op 
order for learning to Judge a series of 11 dots on a scale of 1 to 11. It seem? 
reasonable to expect that early presentation of the extreme stimuli would be 
helpful in defining the ends of the distribution and linking them to the en 
of the response scale. The matter is complicated, however, by the cautious 


ness of the subjects. Most subjects avoid extreme judgments at first, as Wever 
and Zener (1928) sug 5 II be 


ggested long ago, hence the early judgments W} 


and 


al 
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less accurate when extreme stimuli are presented first than when middle 
stimuli are presented first. Thus there are two opposing tendencies to be 
disentangled. Performance on the first series of stimuli was facilitated by 
an carly presentation of the middle stimuli because of the subjects’ guessing 
habits. Learning was facilitated by an early presentation of the extreme 
stimuli, and this order improved accuracy on later series. After five series the 
differences had vanished. As a check on the methodology, Terborg used three 
measures of accuracy—information transmitted, correlation between judg- 
ments and stimuli, and magnitude of error. Although all three measures 
agreed, it was the simple error measure that scemed to be most useful as a 
Measure of improvement in accuracy. 

Even anchoring effects are influenced by practice, especially rewarded 
practice, as Larimer and White (1964) discovered by manipulating judg- 
Ment conditions before the introduction of an anchor. As compared with a 
lts, a group that received a mone- 


group of subjects with no knowledge of resu 
t judgments before the 


tary reward for correct judgments made more correc 
introduction of the anchor and were less influenced by the anchor. 

If this effect of monetary reward is a general one, the ambiguity of the 
subject's task in judgment experiments, especially when the series is shifted, 
could be cleared up by rewarding shift to the new series or adherence to 
the old; the interpretation of contrast effects and such phenomena would be 
clarified also. 


Confidence in Judgment 


ed after the judgment has been made, is 


Confidence in a judgment, report 
n right and also helpful 


a subjective dimension that is interesting in its ow 
in analyzing judging operations. Confidence can be reported in conversa- 
tional terms, such as very confident and quite certain, or on a rating scale. 
Most subjects will readily utilize a numerical or graphic scale from 0 to 100 
ecause it seems natural. They know the meaning of zero confidence or 4 
Pure guess and complete confidence or 100 percent certainty, and they use 
these points readily as end anchors. Distributions of confidence ratings are 
often J-shaped or U-shaped because the end points are used most frequently; 
ut average values are systematically related to stimulus dimensions, and 
Scores representing individual differences have satisfactory internal consis- 
tency CJohnson, 1939). 7 
‘ Psychophysical investigations occasionally require confidence ratings follow- 
Mg comparative judgments, and the data show that confidence increases 
With an increase in “the difference between the variable stimulus and the 
standard (Festinger, 1943; Johnson, 1939). Generalizing to judgments of 
Single stimuli, one would expect that confidence would be lowest at the 
oundary between any two categories, and it is apparent from Fig. 9.19 that 
this generalization holds for scales of two, three, and four categories (John- 


son, 1955). 
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FIGURE 9.19. Confidence and category boundaries. The stimuli were 11 cards h 
earing rows of 40 symbols: 10 to 30 x's and the remainder o's. The subject judged eac 
card as to preponderance of x's, in 1 of 2 categories (A), 3 categories (B), or + 
categories (C). Mean confidence is lowest at the category boundaries, shown at the 
bottom of each graph. From Johnson (1955). 
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psychophysical judgment in which confidence increases with the physical 
difference between two stimuli. But the amount of information was also a 
factor. After 20 pairs of cards had been seen, confidence reached averages of 
50, 59, and 69, respectively. 

Anyone who has studied statistics would ask next about the variability of 
the numbers. The experimenters raised this question also and found that, 
in general, confidence in deciding between the two packs decreased as the 
standard deviation of the difference increased. Similar results were obtained 
when they gave the subjects just one pack of cards, with positive and nega- 
tive numbers, asking whether the mean was greater or less than zero. With 
other factors held constant, the confidence of college students, like that 
of statisticians, increases as a function of the familiar statistic t (Little and 


Lintz, 1965). 


CONFIDENCE AND ACCURACY 
The old question about the relation between confidence and accuracy 
takes two forms. If we ask whether those who express their judgments 
with the most confidence are the most accurate, the answer comes from 
interindividual correlations, and these exhibit much variation depending on 
the materials judged. College students taking multiple-choice tests of 
Vocabulary in eight fields of knowledge and reporting their confidence in 
each choice showed correlations between confidence and accuracy of .13 to 
-45 for the seven academic fields and .76 for sports vocabulary (Johnson, 
1941). Words can be judged for spelling as well as meaning, and the same 
Correlations can be computed. In judging which of four versions of a word 
was correctly spelled, the correlations between confidence and accuracy were 
above .60 CAdams and Adams, 1961). 
he question about confidence and accuracy can also be posec for a 
Single judge. Will he be more accurate in those judgments that he makes 
with more confidence? In general, a change in the materials or conditions 
of judgment that increases accuracy increases confidence. In the comparauve 
Judgment, confidence and accuracy are related (as noted above) because 
both are related to the stimulus difference. A learning experiment can be 
arranged so that confidence and accuracy rise in parallel (Johnson and 
hoades, 1941). Thus it frequently happens that those judgments given 
With the most confidence are the most accurate CObrink, 1948). 
to homogeneous objects, but the same question 
arises for heterogeneous objects of judgment. Will the judge be most con- 
fident in the feld he knows best? This question was investigated by intra- 
individual correlations between confidence and accuracy on eight vocabulary 
tests (Johnson, 1941), and a mean correlation of .46 was obtained. In gen- 
ral, the students were most confident on the tests on which they got their 
est scores, but there was considerable variability. Kelly and Fiske (1951), 
ter Studying the prediction of performance in clinical psychology from 
"eterogeneous data, demonstrated that ng of judgment cannot be 
e d j 
quite - a guide to correctness of rag T i inn for accuracy 5 
raightforward but the motivation for an expression of confidence, like 


These statements apply 
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level of aspiration, is more complicated and the outcome may be unrealistic 
in some situations. 


REALISM OF CONFIDENCE 


The confidence scale is anchored at 0 and 100, but the other numbers 
are not often given explicit meaning. Adams and Adams (1958), however, 
have drawn attention to the ideal relation in which judgments made with 
100 percent confidence would be 100 percent correct, judgments made with 
50 percent confidence would be 50 percent correct, and so on. They ex- 
plained this ideal relation to their subjects and then had them judge word 
meanings and report confidence after each judgment. The discrepancies 
from the ideal were large at first, but those subjects who were shown their 
percentage correct at each confidence level made significant improvement 
in a realistic use of the scale over five days of practice. A later experiment 
(Adams and Adams, 1961) reported some evidence of transfer from judg- 
ing dots and weights to judging word meanings, indicating general improve 
ment in realism of confidence. This research is another illustration of how 


a judge can shift his scale of judgment to correspond with varying task 
requirements. 


CONFIDENCE IN EXPRESSING OPINIONS 


Allport and Hartman (1925), in one of the earliest attempts to measure at 
titudes, collected opinions on controversial topics and confidence in mere 
opinions. Their results indicated that those who held either extremely 
radical or extremely conservative opinions were more confident, and they 
attributed the extremist’s overconfidence to compensation for a feeling ° 
insecurity in his position. 

Similar results were obtained later from students with extremely antiwar 
and anticensorship opinions (Johnson, 1940), but the interpretation was based 
on principles of judgment rather than compensation. Fig. 9.20 shows as 
student's confidence in acceptance and rejection of 84 statements about wal 
apparently the subject divided the attitude continuum into three categories, 


or subranges, or, in the terms of Sherif and Hovland (sce page 435), latitudes 
of acceptance and rejection. Confidence w. i il 
boundaries, according to 
distance from a category 
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as minimal at the two cates? h 
a general principle of judgment, and increased wit 
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ian ; boundary. Thus the most confident judgments wert 
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general appear more confident. 


reject with high confidence and will 


Judgment Time 


ie . s 
Under some conditions, as when a standard stimulus and a variable stimulus 
are exposed simultaneously or when single stimuli are judged, judgme? 
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FIGURE 9.20. Confidence and extremeness of attitude. The subject judged 84 
Statements about war and reported his confidence in each judgment. The 
highest confidence comes from rejection of statements far from a category boundary. 


Data from Johnson (1940). 


recorded. Thus judgment times are formally similar 
to reaction times, but typically they are considerably longer since the tasks 
are more difficult and accuracy rather than speed is emphasized. It has 
long been known that in the comparative judgment, the judgment time 
increases as the difference between the variable stimulus and the standard 
decreases (Cartwright, 1941; Festinger, 1943; Reed, 1951). The curve rises 
toward a peak as stimulus difference decreases and would apparently reach 
infinity at zero stimulus difference, but actually all published curves round 
off before reaching that point. People do not deliberate forever, even when 
two stimuli appear exactly equal. They make a choice after a while, even 
if only an arbitrary one. If we approach this phenomenon by asking, as in the 
Preceding chapter, about the utility of the decision, it is reasonable to as- 
sume that the decision maker will terminate his struggle, one way or an- 
other, when the effort exceeds the expected utility of the outcome. A quanti- 
tative model built around this assumption can be applied to a variety of 
diffcult decision situations (Pollay, 1970). 

When judgments are made by the method of single stimuli, there are 
Several category boundaries. The generalization can then be made that 
judgment time decreases as a function of distance from a category boundary 
Csee Johnson, 1955). Fig. 9.21, taken from Cartwright (1941), displays 
the two peaks of judgment time at the two boundaries of 3-category scale. 
It is not surprising that when the number of scale categories is increased, from 
2 to 64, there is a regular increase in overall judgment time (Bevan and 
Avant, 1968). 

The same relation between judgment time and category boundary was 
demonstrated in a very different experiment by Cohen, Hansel, and Walker 
C1960). They asked twelve-year-old children to attempt to hit a ball through 


time or latency is easily 
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FIGURE 9.21. Judgment time for recognition. Five angles, from 60° to 100°, were 
presented for learning; then 16 angles, 10° to 160°, were presented for recognition. 
Tabulation of the judgments permitted the inference that the subject had divided the 
series of angles into three categories, the boundaries of which were computed as 60° 

and 105°. Judgment time was maximal at these two boundaries. From Cartwright (1941). 


gaps of various widths. One group was to estimate probability of success 
for each width on a scale from 0 to 1.0; another group of children had to 
make a two-category decision whether to try or not, and the decision time 
for this group was maximal at the boundary between anticipated success an 
failure, that is, at the width that had a subjective probability of 0.5. 

In the expanded judgment situation described a few pages back, it was 
noted that confidence in judgments about numbers in a pack p” cards depends 
in part on the amount of information, that is, on the number of cards 
reg same task can be used to study the time spent in arriving at 4 
denii ae n ng allowed to proceed at his own pace, making” 
F : whenever he is ready, and the number of cards required is t 
to a Tvin and Smith (1956) gave their subjects one paek 

positive and negative numbers and asked them to look a 
one card at a time and decide as soon as they could whether the averas® 
was greater or less than zero. On this task also, college students were reason- 


ably good intuitive statisticians because the number of cards requite 
decreased with the deviation from the zero condition and increased with the 
variability. 

It is instructive to apply this t 


. i of 
aos ype of analysis, looking at the distance 
stimuli from category boundaries, i 


$ À : to experiments in which the subjects choos® 
their own categories. Tajfel and Bruner (1966) asked children aged eleven 


and twelve to decide whether each of a series of 44 lines was five inches 
Jong or not. They then divided the children into a group of broad catego” 
izers, who said yes to 34-39 lines; a group of medium categorizers, who 5% 
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yes to 22-33 lines; and a group of narrow categorizers, who said yes to 14-21 
lines. The principal finding was that the medium categorizers made their 
judgments more slowly, on the average, than the other two groups. 

Although category boundaries were not reported, it is a reasonable con- 
jecture that all subjects divided the series of 44 lines into 3 categories: 
less than 5 inches, to which the response would be no; about 5 inches, to 
which the response would be yes; and more than 5 inches, to which the 
response would be no. The broad categorizers, according to the present 
analysis, would make their yes responses faster than their no responses be- 
cause some of the lines judged yes were in the middle of the large middle 
category, far from either category boundary, while the few lines judged no 
were close to one or the other boundary. The narrow categorizers would 
make their yes responses slower than their no responses because all lines 
within the narrow middle category were close to a category boundary; fast 
no responses could be judgments of extremely short and extremely long 
lines. The data reported agree with these speculations. It follows also that 
the judgment times of the medium categorizers would be generally slow 
because they could find few lines that were far from a category boundary. 

In general, it is well established that judgment time decreases and con- 
fidence of judgment increases with distance from a category boundary. 
Regardless of the stimuli being judged, the two dependent variables are 
understandably related. On the confidence scale from 0 to 100, as confidence 
increases (or doubt decreases) arithmetically, judgment time decreases 
geometrically (Johnson, 1939). 


Individual Differences in Judgment 


The variations between individuals in the activities discussed in this 
chapter, usually obscured by the use of averages, are often striking, and some 
of them have been studied directly (Johnson, 1955, pp. 382-384). Individual 
differences in confidence and speed of judgment can be measured with good 
reliability, and these traits exhibit some generality over variations in the 
objects of judgment. In the special case of judgment of attitude statements, 
individual differences in confidence depend, as noted above, on the subject's 
own attitude. When children and adults make estimates of various objects 
On familiar scales, the width of the categories used can be measured with 
800d reliability; this trait also exhibits some generality and can be related 
to other characteristics of the judges (Bruner and Tajfel, 1961; Pettigrew, 
1958; Tajfel and Bruner, 1966). Extremity of judgment—the tendency to 
Use the end categories of a scale rather than the middle—is also a rather 
Seneral trait, but its relation with personality variables is not clear (Warr 
and Coffman, 1970). 

The shift experiment, which requires the subject to judge one series of 
Stimuli, then another, on the same scale of judgment, also discloses large 
Variations between individuals in the rate of adjusting to the new series. A 
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persons. Sieber and Lanzetta (1966) discovered that cognitively complex 
persons also require more information before making decisions about ambigu- 
ous pictures and express greater uncertainty about their decisions. A hypoth- 
esis developed from the information-processing approach states that the complex 
persons have more associations to the stimuli and carry on more differentiation 
and encoding. This hypothesis was supported when training in the mediating 
cognitive activities made the judging processes of the simple subjects more 
like those of the complex subjects. 

We must conclude that research on individual differences in judging 
activities, as in problem-solving methods and styles, has been characterized 
more by versatility of theoretical interpretation than by uniformity of results. 
Clear outlines have not yet emerged, but occasional flashes of illumination, 
from different quarters, continue to encourage hope for more consistent en- 


lightenment. 


Major Trends and Implications 


Thought and communication are carried forward, not only by putting 

some stimulus objects rather than others into various unrelated categories, but 
also by putting all the objects under consideration into categories ordered 
along a scale. The stimulus objects are observable, the names of the categories 
constitute a system of observable responses, and it is conceivable that the 
judging process consists of direct association of specific responses with spe- 
cific stimuli; but no one seems to argue for this. Most investigators seem to 
assume—though the explicitness of this assumption varies—that the scale 
of judgment is a construct, mediating between the stimuli and the responses, 
and the quantitative precision of many judgment experiments makes this 
construct more convincing than most. Boundaries between categories, context 
effects, the central tendency of judgment, anchor effects, and variations of 
speed and confidence of judgment near category boundaries are all properties 
of scales of judgment, not of the stimulus series or the response system. 
Scales of judgment are cognitive constructs, or schemas, in that the objects 
of the world are known, discussed, and evaluated in reference to scales. It 
is unfortunate that research on cognitive consistency and research on scales 
of judgment have proceeded independently, but some steps toward synthesis 
are being taken. 
. The question of the organization of a scale of judgment is one of ad- 
Justing the response system given by instructions to the series of stimuli. 
Scales are likely to be organized from the middle outward when there is a 
standard or salient anchor at the middle of the series and from the ends in- 
ward when the ends are salient. When no stimulus or response category is 
salient, the scale is not anchored at any special point and variability is uni- 
form throughout the series. 

When a scale of judgment is adjusted to a series of physical stimuli or 
attitude statements and then the series is shifted, the scale is usually realigned. 
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But the amount of realignment varies considerably, depending on the response 
system, the instructions, the amount and recency of experiences with the 
old series, the nature of the objects of judgment, their range, and the 
presence of anchor stimuli. The question whether the shift in jidgment 
of an object constitutes a “genuine” shift in perception or a change in F 
titude probably cannot be clearly answered by recording only one kind o 
response; convergence or divergence of inferences from different measures 
will be more enlightening. The importance of the subject's approach to the 
task is documented by the results of complex experiments in which attention 
is manipulated, either by instructions or by variation of relations between 
stimulus dimensions. Under some conditions, perhaps, the subject's response 
may be treated as a direct measure of the effects of a stimulus, but this must 
be a rare situation. As judgment experiments become more complex, they 
resemble experiments on concept achieveme 
the subject's understanding of his t 
meaningful today to talk about strategies of judgment. N 

In psychophysical and social judgment, contrast effects are common. / 
stimulus may be judged large in contrast to one series and small in contrast 
to another or to an extreme anchor stimulus. Assimilation, however, is not 
common and, when it occurs, not well understood. f 

The well-known theory of adaptation level (AL) has stimulated much T 
the research reviewed in this chapter, has been applied to a variety ne 
experimental Situations, and seems to account, in a rough way, for much pi 
the data. The criticisms revolve around the assumption of a logarithmic moe 
tion, the idea of a level to which other stimuli are referred, the weight ki 
anchor stimuli, and ad hoc statistical adjustments. The most enduring uE 
of AL theory is probably the pooling feature, the hypothesis that the effects 
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predict single judgments. 


hen judgments are linearly 


nt and problem solving, so that 
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related to stimuli, computation of a 
conventional correlation coefficients and regression equations accounts tOr 
scale values of stimuli, including the effects of external anchors and of the 
central tendency of judgment. Th 
attention to the variability of the si 
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mediate statements ambiguously written. Some contrast due to order of presen- 
tation has also been found, but the large context and contrast effects typical 
of perceptual judgments do not appear in these judgments of easily remem- 
bered social objects. 

Studies of the judge’s own attitude indicate that each judge divides the 
attitude continuum into a range of opinions that he rejects as too anti, a 
range that he accepts, and a range that he rejects as too pro. Judgments of 
opinions in the transition regions are variable and uncertain. If the judge’s 
opinion is outside the range of opinions presented, he extends the scale 
to include his own position. These effects of the range of opinion and of 
the judge's attitude illuminate the psychology of judgment and help to 
explain some of the effects found in perceptual judgment, but they do not 
negate the fact that high linear correlations are usually reported between 
scale values from different methods and different groups of judges. 

Most judgment research has unfortunately been limited to accounting for 
mean scale values rather than single judgments, but it is evident that the 
accuracy of the single judgments is increased somewhat by increasing the 
number of scale categories, by adding anchors, and of course by practice. 
The judge learns the distribution of stimuli and adjusts his scale to this 
distribution, rewarded probably by noting observation of consistency in 
his judgments. Since the judge’s task in many judgment experiments is 
ambiguous, a better definition of the task and instructions emphasizing ac- 
curacy might clarify some of the ambiguous results. ; 

Confidence in a judgment varies up and down in predictable ways, being 
lowest when the difference between two stimuli to be discriminated is small, 
when a single stimulus or attitude statement to be categorized is near a 
boundary between two scale categories, and when the information for 
Categorization is, in a statistical sense, nonsignificant. Confidence is related, 
in one way or another, to accuracy, and this relation can be improved by 
Practice, so that the confidence reports become more realistic. 

By the same principles, judgment time is longest when the difference 
between two stimuli to be discriminated is small, and when a single stimulus 
to be categorized is near a category boundary. Data on the time spent by 
college students in collecting information for decisions, like the data on 
confidence, indicate that they are fairly good intuitive statisticians. These 
Principles relating to confidence and time around category boundaries 
offer rather simple explanations for some apparently complex phenomena of 
Judgment. : 

These two chapters on judgment have integrated data on tasks as varied as 
brightness estimation, impression formation, and the scaling of attitude state- 
ments, and performance on these tasks is supposed to be related to perception, 
Personality, and social psychology. But looking naively, behavioristically, at the 
Subject’s activities we see him assigning various objects to categories. One may 
hope that these performances are indirectly dependent on other psychological 
functions, but the fact that the subjects are making judgments is undeniable. 
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